Skip to main content

Full text of "Relativity Theory Of Protons And Electrons"

See other formats


DAMAGE BOOK 



^ 

w > ro 

m< OU_1 58925 5m 

^ CD X CO 



OSMANIA UNIVERSITY LIBRARY 

Call No. ^2,0 < ) Accession No, [^ ) 

Author 

Title 



4 
This book slwultf b* returned on or beforo the date 



^/ 
n or b 
last marked below, jp $? c 



RELATIVITY THEORY 

OF 
PEOTONS AND ELECTEONS 



LONDON 

Cambridge University Press 

FKTTJS& LANE 

NEW YORK TORONTO 
BOMBAY CALCUTTA MADRAS 

Macraillau 

TOKYO 

Maruzm Company Ltd 
All rights reserved 



KELATIVITY THEORY 

OF 

PKOTONS AND ELECTRONS 



by 
SIR ART II U R E I) I) I N (j TON 

M.A., D.Sc., LL.fiTRK.S:""" 

riuminn 1'rnJiMxtr al Jv/ ( <>in>inu <tnt{ Kriicnnn'Hlul 
I'lnlnioiilni in Ihc L'niwrsifti n( t'nnihrittu 1 ' 



CAMBRIDGE 

AT T1IE UNIVERSITY PRESS 
1036 



PRINTED IN GREAT BRITAIN 



PREFACE 

In this book I have endeavoured to give a connected account of a series of 
investigations in the borderland between relativity theory and quanjbi^H 
theory. It begins where my earlier book, The Mathematical Theory of 
Relativity, leaves off at the point where in our survey of nature we encounter 
the phenomenon of atomicity. To our gross senses matter seeing .continuous, 
and it has been treated as continuous in the usi^tl tftCory of relativity. 
Experiment has, however, taught us that it is composed of multitudes of 
units, and the theory is here extended to throw light on the existence and 
properties of these units. 

The central problem is to ascertain the conditions which fix the amount of 
mass and electric charge carried by protons and electrons. The present 
researches will probably be associated in the minds of many readers with 
the number 137; this is one of four numerical constants of nature for which 
the theory predicts definite values. Another fundamental constant, found 
to be 2.136.2 256 (approximately 3-150.10 79 ), can be described as the 
number of protons and electrons in the universe; but its practical importance 
is that its square root enters into the ratio of the electrical to the gravitational 
force between a proton and electron. The result of these determinations is 
that there are no arbitrary constants left in the scale of relations of natural 
phenomena. 

Besides giving concrete results of this kind, the theory has, I hope, thrown 
light on some of the obscure points in quantum theory and helped to deepen 
its foundations. I have sought a harmonisation, rather than a unification, 
of relativity and quantum theory. I do hot set out to obtain an all-embracing 
formula; but the investigation shows in detail how to combine the con- 
ceptions of the two theories in the solution of specific problems, which would 
be outside the range of either theory separately. 

The theory, as it was being developed, was published from time to time 
in the Proceedings of the Royal Society (121, p. 524; 122, p. 358; 126, p. 696; 
133, pp. 311, 606; 134, p. 524; 138, p. 17; 143, p. 327; 152, p. 253) and the 
Journal of the London Mathematical Society (7, p. 58; 8, p. 142) between 1928 
and 1935. But it has become increasingly difficult to deal with it in frag- 
ments. Much of it (including practically the whole of Chapters vi, xi and 
xvi) is now published for the first time. The new results of a practical kind 
include the theory of the Stern-Gerlach experiment, the theory of Bond's 
correction f|| to ejm, and the direct calculation of the number of particles 
in the universe. 



vi Preface 

In so extensive a work I cannot expect that serious mistakes have been 
entirely avoided. But now that the theory can be viewed as a whole, I think 
the reader will be convinced that there is a practicable way of progress along 
the lines I have attempted. I hope therefore that he will see in the im- 
perfections of this book an opportunity for developing, not an excuse for 
dismissing, the subject which it sets forth. 

Prof. G. Lomaitre and Prof. G. F. J. Temple have kindly read the book 
in proof. Their interest and criticism has encouraged me in the develop- 
ment of the theory, and I now owe them a further debt for many helpful 
suggestions. 

A. S. E. 

CAMBRIDGE, 
June 1936. 



CONTENTS 

INTRODUCTION l"*ge 1 

PART I. WAVE-TENSOR CALCULUS 

CHAPTER 1 Tensors and Matrices 13 

II The Sixteenfold Frame 20 

IK The Resolution of Matrices 34 

IV Space Vectors 60 

V Tho Simple Wave Equation 02 

VI Reality Conditions 75 

VII Strain Vectors and Phase Space 92 

Vlll The Differential Wave Equation 115 

IX The Hydrogen Atom 140 

X Double Wave Vectors 154 

PART II. PHYSICAL APPLICATIONS 

XI The Ricmann-ChristoffelTensor 179 

XII The Mass-ratio of the Proton and Electron 212 

XIII Standing Waves 229 

XIV The Cosmical Problem 256 
XV Electric Charge 281 

XVI The Exclusion Principle 308 

INDEX 331 



INTEODUCTION 

0*1. In 1928, P. A. M. Dirac made a bridge between quantum theory 
and relativity theory by his linear wave equation of the electron.f This 
is the starting point of the development of relativity theory treated in 
this book. 

Previously there had been three principal stages of progress, namely 
Einstein's special theory (1905), his general theory (1915), and Weyl's 
theory of relativity of gauge (1918). Summarising the state of the theory in 
1923, I wrote J 

We offer no explanation of the occurrence of electrons or of quanta; but in 
other respects the theory appears to cover fairly adequately the phenomena of 
physics. The excluded domain forms a large part of modern physics, but it is 
one in which ail explanation has apparently been baffled hitherto. The domain 
here surveyed covers a system of natural laws fairly complete in itself and 
detachable from the excluded phenomena, although at one point difficulties arise 
since it comes into close contact with the problem of the nature of the electron. 

Relativity theory was in fact as comprehensive and as logically complete 
as a purely macroscopic theory had any right to be. The next important step 
must be an extension to cover microscopic phenomena, or a unification with 
existing microscopic theories. 

Microscopic physics was the province of quantum theory; but in 1923 this 
was little more than a collection of empirical rules which led to no coherent 
outlook. The "new quantum theory" began with Heisenberg'n researches 
in 1925, and with the aid of many contributors it reached soon afterwards 
the current form generally called wave mechanics. The conditions were 
becoming ripe for a unification with macroscopic relativity theory. 

To say that Dirac's wave equation was the first connecting link gives only 
a partial idea of its importance. It was a challenge to those, who specialised in 
relativity theory. Dirac's object was to obtain a form of equation (fulfilling 
certain requirements of quantum theory) which should be invariant for 
rotations and Lorentz transformations. We had claimed to have in the 
tensor calculus an ideal tool for dealing with all forms of invariance and 
covariance. But instead of using the orthodox tool Dirac proceeded by a 
way of his own, and produced an expression of very unsymmetrical appear- 
ance, which he showed to be invariant for the transformations of special 
relativity theory. Why had this type of invariance eluded the ordinary 
tensor calculus? As C. G. Darwin put it, "it is rather disconcerting to find 

t Proc. Hoy. 8oc. A, 117, 610 (Feb. 1928). 
j Mathematical Theory of Relativity, p. 237. 



2 Introduction [<M 

that apparently something has slipped through the net ".f It was Darwin's 
insistence on this point in private conversation which led me to take up 
these investigations. 

The failure of ordinary tensor calculus to include Dirac's type of invariance 
is due to the introduction, at an early stage, of a convention whose arbi- 
trariness had already been noticed. J The analytical theory of tensers had 
been applied to physics by identifying its basic vector with a geometrical 
displacement (dx)P. By a change of application, namely by identifying the 
basic vector with Dirac's four-valued quantity 0, we obtain a new tensor 
calculus, here called wave-tensor calculus, in which the invariance of the 
wave equation falls into order. Formulae have to be found for expressing 
the old tensors (space tensors) in terms of the wave tensors; and this leads 
to a chain of new developments which have no counterpart in the tensor 
calculus of ordinary relativity theory. 

I was soon convinced that this was the extension of relativity theory for 
which we had been waiting, and that Dirac's equation was only the beginning 
of a more far-reaching application of the methods and conceptions of 
relativity theory to microscopic phenomena. After seven years' work I find 
the possibilities latent in the new departure still far from exhausted. 

Naturally others besides myself were attracted to the new opening. 
Allowing for divergences in the point of view, my first paper, dealing with 
formal developments including the rudiments of wave-tensor calculus, was 
perhaps not materially different from several other investigations published 
about the same time. || But a month or two later I camo across a clue to the 
origin of the charge of electrons and protons, ^f The trail has led all round the 
universe, so that the subject with which I began comes almost at the end of 
this book (Chapter xv). Ultimately the problem of the origin of charge was 
found to be inseparable from the problem of the origin of mass. I was thus 
led into a special field of investigation which, I think, has not been explored 
by other writers. 

Dirac's wave equation has led to important advances in quantum theory; 
but here we shall be working mainly on the relativity side of the bridge. It is, 
of course, impossible to treat protons and electrons without introducing a 
considerable amount of quantum theory. But its subordinate position will 
be apparent from the fact that the problems treated in Part II of this book 
are not touched upon in books on quantum theory; they depend essentially 
on developing the consequences of the relativistic conception. 

f Proc. Roy. Sac. A, 118, 664. This paper was of great assistance in my early work. 

j Mathematical Theory of Relativity, p. 49. Proc. Roy. Soc. A, 121, 524. 

|| I think that the most far-reaching, as well as the earliest, paper of this type was by 
J. v. Neumann, Zeits.fur Physik, 48, 868. But I have been more influenced by H. Tetrode, 
ibid. 50, 346, whose point of view was less unfamiliar to me. 

If "The Charge of an Electron", Proc. Roy. Soc. A, 122, 358 (Dec. 1928). 



0-2] Introduction 3 

0*2. As the work proceeded, it became focused on one problem, namely 
the origin of the four numerical ' c constants of nature ' ' . Seven fundamental 
constants are commonly recognised: 

m e the mass of an electron, 

m p the mass of a proton, 

e the charge of an electron, 

h Planck's constant, 

c the velocity of light, 

K the constant of gravitation, 

A the cosmical constant. 

Between these we must eliminate our arbitrary units of length, time and 
mass; we are then left with four purely numerical ratios. The most familiar 
are the mass-ratio m p /m e> and the fine-structure constant Ac/2?re 2 ; these are 
found in Chapters xir and xv. The value of *c, i.e. its ratio to a constant of 
similar dimensions furnished by the other constants, is obtained in Chapter 
xiv. Also, with the help of the other constants we replace A by a number N, 
the " number of particles in the universe", whose theoretical value is found 
in Chapter xvi. Thus all four constants are obtained by purely theoretical 

calculation.! 

The number of dimensions of space-time might be regarded as a fifth 
natural constant. Even this number is found to be determined unam- 
biguously by the epistemological principle that we can only observe 
relations between two entities ( 16-8). At a much earlier stage (Chapter vi) 
we prove that a four-dimensional neutral domain necessarily has the 
signature 3+1. 

So far as I can make out, the values of the constants given by this theory 
are in full agreement with observation. For three of the four constants the 
observations are accurate enough to provide a very stringent test. It would 
have been disconcerting if it had turned out otherwise; but the theory does 
not rest on these observational tests. It is even more purely epistemological 
than macroscopic relativity theory; and I think it contains no physical 
hypotheses certainly no new hypotheses to be tested. All that we require 
from observation is evidence of identification that the entities denoted by 
certain symbols in the mathematics are those which the experimental 
physicist recognises under the names "proton" and "electron". Being 
satisfied on this point, it should be possible to j udge whether the mathematical 
treatment and solutions are correct, without turning up the answer in the 
book of nature. My task is to show that our theoretical resources are 

t A general account of the principles on which the calculations are based is given in 
New Pathway* in Science, Chapter XL 

1-3 



4 Introduction [0-2 

sufficient and our methods powerful enough to calculate the constants 
exactly so that the observational test will be the same kind of perfunctory 
verification that we apply sometimes to theorems in geometry. 

The replacement of four empirical natural constants by calculated num- 
bers implies a unification of theory. Tri Maxwell's unification of electro- 
magnetism and optics the ratio of the electromagnetic to the electrostatic 
unit of charge was found to be equal to fche velocity of light ; similarly in the 
unification of macroscopic and microscopic theory the macroscopic con- 
stants K, A are found to be expressible in terms of m p ,m e ,e. The elimination 
of superfluous constants is an outward sign of the unification achieved; and 
for this reason I have regarded it as the first goal. But the theory that has 
taken shape in these investigations should supply a foundation for the 
treatment of other microscopic problems for which current quantum theory 
is insufficient. I have not pursued these further developments, partly because 
they often require a knowledge of the more technical side of quantum theory 
which I do not possess, and partly because the completion of the calculation 
of the four natural constants has seemed an appropriate stage at which to 
assemble the theory into a connected form. By way of exception, I have 
applied the theory to the Stern-Gerlach experiment (12-8); the result 
agrees with observation. 

0-3. The marriage of relativity theory and quantum theory should be a 
fruitful union as well as a formal union. In regard to the numerous formal 
unified theories that have been suggested, we may recall 

There are nine and sixty ways of constructing tribal lays, 
And every single one of them is right ! 

But T think they have been inspired by a fundamentally different conception 
of the problem of unification from that which I shall follow. There is a con- 
siderable amount of formal theory in this book; but it has been developed 
concurrently with the physical theory in Part II; and its progress has been 
guided as much by the definite applications, in which it was to be used, as 
by formal considerations. 

A unified theory does not necessarily mean a unified formula. The latter 
kind of unification is exemplified by the theory of the "Generalised Astro- 
nomical Instrument" which combines in a single equation the theory of the 
altazimuth, meridian circle, prime vertical instrument, equatorial and 
almucantar.f Such compression appeals more to the mathematician than to 
the physicist. We do not aim at producing a formula which shairinclude 
simultaneously the irregular gravitational fields of general relativity and 
the quantised energy of an atom. We seek instead the common meeting 
point from which the specialised developments and approximations appro- 
t ManMy Notices, H.A.S. 68, 171. 



0-4] Introduction 5 

priate to the gravitational and the atomic problem branch off. The source 
of quantum phenomena is a degeneracy, or exceptional integrability, which 
is associated with uniformity and symmetry, and invalidates the assumptions 
underlying the ordinary theory of macroscopic averages. It is therefore in 
uniform conditions (spherical space) that the linkage of quantum theory to 
macroscopic relativity theory must primarily be studied. 

For this reason we have not much to do with the formulae of general 
relativity, though we have much to do with its principles. We generally treat 
space-time of uniform curvature either the de Sitter form hyperbolic in 
the time dimension, or the Einstein form cylindrical in the time dimension, 
the space being spherical in either case. Thus, on the relativity side, we halt 
at a stage intermediate between the general theory and the special relativity 
theory of flat space. 

For the same reason, when gauge transformations are employed, the 
formulae used are those of Weyl's theory, not the author's generalisation 
of it.| The generalisation would be required if we dealt with space-time of 
irregular curvature; it coalesces with Weyl's theory in the uniform conditions 
here considered. 

With flat space-time we have nothing to do. The theory of space-time 
will here be developed pari passu with the theory of the material systems 
which occupy it. in this mode of approach the conception of infinite flat space 
never arises; it could not be brought into the theory except as a limit that 
might be approached but never attained. But in the same way the concep- 
tion of definitely empty space never arises; it could not be brought into the 
theory except as a limit that might be approached but never attained. 
(By definitely empty we mean that the probability of containing a particle or 
photon is zero.) Our rejection of flat space-time does not depend on the view 
that definitely empty space has a natural curvature determined by the 
cosmical constant.^ Space appears in our theory as the domain of the pro- 
bability distribution of a particle, so that it is an essential characteristic 
of space that it is occupied or has a finite a priori probability of being 
occupied; and it is non-controversial that it will have a curvature (or an 
expectation- value of the curvature) corresponding to the energy tensor (or 
expectation-value of the energy tensor) of its contents. 

0'4. I think it will be found that the theory is purely deductive, being 
based on epistemological principles and not on physical hypotheses. But 
it could not be presented in purely deductive form which would mean, 
I suppose, that it was treated as an investigation in pure mathematics with 
a physical denouement in the last chapter. It has seemed essential to 

t Mathematical Theory of Relativity, Chapter vu, Pt. II. 
j We are led to reject this view ( 11*7). 



6 Introduction [0-4 

keep the physical applications in mind throughout; for this purpose 
results must sometimes be anticipated which are not reached by deduc- 
tion until much later in the book, and interpretations must be employed 
which are not definitely established until the whole theory is connected 
together. 

This method gives rise to certain difficulties. For mathematical seasons 
we have to begin with the simplest equations; but these correspond to 
highly idealised systems to which the ordinary physical conceptions only 
partially apply. It is indeed obvious that a system must attain a consider- 
able degree of complexity before anything remotely resembling the ordinary 
method of observation is applicable to it. Consequently the physical ideas 
can only take shape gradually as we proceed. Space-time of a kind first 
appears in Chapter iv; but it has the wrong signature, and its scale is much 
too small. These defects become rectified as the developments in later 
chapters take us closer to actuality. So with most of the physical concep- 
tions; we .have to introduce preliminary notions before the theory is suffi- 
ciently advanced for a full treatment. The reader will probably find that 
many of the difficulties that occur to him in reading the earlier chapters 
arise from the unnatural conditions postulated in the most elementary 
equations, and that they resolve themselves automatically when the theory 
reaches more realistic problems. 

Those who are expert in quantum theory should bear in mind that we are 
proceeding from another starting point from that usually adopted. It 
would be foreign to my intention of developing the theory as a pure deduction 
from relativistic principles to transfer conclusions, however widely accepted, 
from the usual quantum theory which contains a large empirical element. 
Nevertheless, I make frequent appeals to current quantum theory for three 
purposes. Firstly, because it contains the definitions of the quantities with 
which I am concerned. It would be impossible to make a theoretical deter- 
mination of the constant known as the "mass of an electron" without an 
examination of the equations by which the quantum physicist has chosen 
to define it. Secondly, where the present theory coalesces with current 
theory, it is unnecessary to repeat purely analytical investigations which 
equally apply to either theory. Thirdly, certain results (especially the 
Exclusion Principle), which cannot be treated until late in the book, have 
been borrowed from current theory in anticipation. 

It may be well to make it clear that although the present theory owes 
much to Dirac's theory of the electron, to the general coordination of quan- 
tum theory achieved in his book Quantum Mechanics, and to the many 
contributions of himself and others on these lines, it is not " Dirac's Theory " ; 
and indeed it differs fundamentally on most points which concern relativity. 
It is definitely opposed to what has commonly been called "relativistic 



0-5] Introduction 7 

quantum theory ", which, I think, is largely based on a false conception of 
the principles of relativity theory. 

Atomic nuclei and free neutrons are outside the scope of this book. I see 
no reason to fear that they will not fall into place in the theory; but I have 
not developed any ideas on this point far enough to be worth recording. In 
the main the theory of radiation has also been excluded; but there are three 
short references ( 9-5, 14-3, 16-3) which show how it might be approached 
in the present treatment. 

0* 5. The division of the book into two parts, the one ostensibly treating 
the auxiliary mathematical calculus and the other the physical applications, 
is only a rough separation. Physical interpretations are considered as early 
as possible; but with the introduction of double wave tensors in Chapter x 
the relation of the mathematics to the physics changes considerably. 
Instead of starting with the mathematical result and interpreting it as far 
as possible physically, we start with the physical problem and formulate it 
mathematically. The auxiliary mathematical development still continues; 
but it is now guided by the character of the physical problems for which its 
aid is required. 

The student of relativity theory may well feel a grievance at the turn 
which the auxiliary mathematics has taken. The macroscopic theory seemed 
to indicate that Differential Geometry was the key to world-structure. After 
being at pains to acquire some familiarity with this subject, we find that all 
the new advances depend upon modern Algebra. The algebra required in 
the present book is developed practically ab initio by old-fashioned methods 
which, I fear, betray my limitations as an algebraist/though they may make 
the theory more accessible to those most interested. Let me freely admit that 
ability to use the more powerful modern algebraic methods would be an im- 
mense advantage in handling these problems. For the kind of algebra chiefly 
required I have found most helpful C. C. MacDuffee, The Theory of Matrices J[ 

A few remarks on terminology, etc. may be useful. I would direct special 
attention to the limited use of the summation convention (p. 22), to my 
unorthodox use of the term "algebraic" (p. 20), and to the change in the 
order of writing the suffixes of the Riemann-Christoffel tensor (p. 181). 
I would emphasise that wave analysis is a method, not a theory, and may be 
applied to any physical tensor; therefore statements about the physical 
meaning of the various products of wave analysis necessarily refer to some 
special application (singled out by custom) and are not of general validity. 

The term universe is used so often as perhaps to suggest megalomania. 
It is really the opposite of megalomania, for it takes the place of infinity in 
elementary wave mechanics. Mathematically it is much easier to treat a 

t Ergebnisse der Mathematik und ihrer Grenzgebiete (Julius Springer, 1933), 



8 Introduction L' 5 

whole universe than part of one the universe being, of course, idealised to 
accord with the simple conditions postulated in elementary problems. It is 
more elementary to suppose that the uniform conditions continue inde- 
finitely than to terminate them by a physical barrier; supernatural barriers 
are often misleading, and should be avoided if possible. In the earlier 
chapters there is sometimes a difficulty in deciding whether our equations 
refer to an electron lor to the universe. But the fact is that the electrons treated 
in ordinary elementary quantum theory are very much like the universe 
only bigger. They are said to be "infinite plane waves". No doubt it is 
intended that they shall be replaced by waves of more reasonable dimensions 
in practical approximations; but this applies also to our theory. For applica- 
tions in which a millimetre is a good enough approximation to infinity, it is 
a fortiori a good enough approximation to 400 megaparsccs. 

By a particle, I mean, not a classical particle, but a conceptual entity 
whose probability distribution is specified by a \vave function. At different 
stages in this book, different applications of wave analysis are made; and the 
corresponding particles have different properties. In the earlier chapters 
the particles are rudimentary protons and electrons existing in the rudi- 
mentary space-time there treated. They gradually develop into recognisable 
electrons and protons in macroscopic space-time, when the theory is ex- 
tended far enough to introduce the observable relations by which protons 
and electrons are known experimentally. The reader should therefore not 
be surprised to find that initially the positive and negative particles have 
completely symmetrical properties; that is merely another illustration of 
the fact that the most elementary equations imply highly idealised con- 
ditions to which the ordinary conceptions of physics only partially apply. 

0*6. Those who have followed the progressive development of the 
theory during the last eight years may desire a comparison of the present 
revised theory with earlier versions. The papers are numbered for reference 
as folio ws:f 

1 . "A Symmetrical Treatment of the Wave Equation " , R.S. 121 , 524, 1 928 
Tl. "The Charge of an Electron", R.S. 122, 358, 1928. 

III. "The Interaction of Electric Charges", R.S. 126, 696, 1930. 

IV. "The Properties of Wave Tensors", R.S. 133, 311, 1931. 

V. "The Value of the Cosmioal Constant", R.S. 133, 605, 1931. 
VI. "The Mass of a Proton", R.S. 134, 524, 1931. 
VTI. "Sets of Anticommuting Matrices", M.S. 7, 58, 1931. 
VIII. "Theory of Electric Charge", R.S. 138, 17, 1932. 
IX. "The Factorisation of ^-Numbers ", M.S. 8, 142, 1933. 
X. "The Masses of the Proton and Electron", R.S. 143, 327, 1933. 
XI. "The Pressure of a Degenerate Gas, and Related Problems ", R.S. 152, 
253, 1935. 
t M.S. refers to Proc. Roy. Soc. and M.S. to Journ. Land. Math. Soc. 



0-6] Introduction 9 

I will begin with a definite withdrawal. In I it was suggested that the 
adjoint tetrads A\, E 2 , E z , E and JS? 15 , JK 25 , jB 35 , # 45 correspond to electrons 
of opposite spin, and this niisjudgment persisted in 111. The present view 
( 4-3, 4-4) of the role of E 5 was first reached in IV. I think there is no 
other point on which I went so completely astray; the other lines of 
development begun in this series of papers, though sometimes requiring 
substantial amendment, contain advances which in principle have been 
retained. The papers fall into three groups: 

(a) Auxiliary Mathematics (I, IV, VIT, IX). The ^'-symbols were at 
first defined so that A^ 2 = 1 ; the present notation, with E^ 1, begins 
in IV. In IV the mathematical development remains satisfactory; but 
the physical interpretation was confused, because a degenerate wave 
tensor was used in a context where the later results substitute a non- 
degenerate wave tensor. The proofs of two important theorems in VII 
and IX, namely the composition of a pentad by three imaginary and two 
real matrices, and the standard forms of pure wave tensors, have been 
found to be imperfect. Amended proofs are given in 3-5, 5-5. 

(b) Electric Charge (11, III, V11I). Paper II has been affected less than 
most of the early papers by subsequent progress, and can be regarded HH 
substantially correct so far as it goes. Gauiit's form of the matrix co- 
efficient of the Coulomb energy, employed in the paper, is now obsolete; 
but this scarcely affects the investigation at the stage concerned. The 
factor 136 was changed to 137 in subsequent papers; but the difference 
is a question of definition ( 15-9). Paper 111 represents an interim stage 
in a complicated investigation, and has the defects of an interim report. 
Progress remained unsatisfactory until the interchange energy was associatec 1 
with the operator P instead of with E$. The theory in Vlli is substantially 
the same as the second of the two methods given in this book ( 15-7). 

(c) Origin of Mass (V, VI, X, XI). With regard to V and VI, which 
were preliminary papers, 1 need only say that the present theory follows 
the ideas there suggested. In X the argument now replaced by the 
formula jR 2 = JS./2' was unsatisfactory; otherwise the changes are mainly 
of the nature of amplification. Paper XI is practically up to date ; but 
a numerical change was made necessary by the discovery of an in- 
consistency of a factor 2 in current quantum theory ( 9-6). 

The various lines of investigation were very much interlocked ; a back- 
ward state of one prevented progress in the others. Thus the whole work 
reached completion as one unit. The cosmical problem treated in XI was 
the last item on the main programme; and, after it was solved, there was 
not much difficulty in supplying the remaining investigations needed to 
fit together all the material. The investigations in the published papers 



10 Introduction [0-6 

are, of course, considerably altered in form now that they are connected 
to a homogeneous theory instead of to fragments of current theory 
partially modified to suit the growth of ideas. 

It was found in III that the theory offers an explanation why one 
dimension of the world differs from the other three; but, except for this, 
little attention was paid to reality conditions in the series of papers. 
This was deliberate; because it seemed premature to try to formulate 
reality conditions before the main lines of connection of the analytical 
theory with observational phenomena were settled. It was not until after 
the last of the published papers that I took up the problem and reached 
the reality conditions formulated, and used extensively, in this book. 

Strain vectors first appeared in X, but were to some extent anticipated 
in VIII. The more extended use of strain vectors, and the systematic 
discrimination between internal and external wave functions, is a feature 
of the new treatment. Other portions of the theory scarcely touched on 
in the published papers are 8-4-9-1, 10-4-11-9, 12-6-12-8, 15-8-16-9. 

At one time 1 laid stress on a suggestion (due to Zanstra) that the 
packing ratio in helium is an approximation to f|. It seemed likely that 
the binding of particles in a rigid nucleus might be represented as the 
loss of one degree of freedom of the double wave functions, with a 
corresponding reduction of the energy required for statistical equilibrium. 
Recent atomic weights make the packing ratio less close to J-|| than was 
at one time supposed; but in any case the theory of a rigid nucleus was 
not expected to apply to helium exactly. I still regard the suggestion as 
plausible; but as my investigations have not dealt with nuclear structure, 
the question remains in suspense. 



PAET I 
WAVE-TENSOR CALCULUS 



CHAPTER I 
TENSORS AND MATRICES 

1*1. Linear Transformations . 

A physical system may be described in many alternative ways. Different 
systems of coordinates may be used for specifying its position; different 
systems of units may be used for the measurement of mass, length, time; 
and so on. Accordingly our attention is directed to the problem of comparing 
systems of description in which there is a one-to-one correspondence between 
quantities A, B, C, ... occurring in one description and quantities .4', B 1 ', 
C', ... occurring in another description. 

The description commonly includes sets of associated quantities which 
are regarded as "components" of a single entity, e.g. the three components 
of a force. We then have a correspondence between an array of n quantities 
Ap in one description and A^ in another description (/u = 1, 2, ... n). 

We proceed at once to a special case of great importance, viz. when A ' is 
given by a linear transformation of A^ 

...+q lH A n [ 

...+?i,i4J 

etc. Using the summation convention of the tensor calculus, these formulae 
are written more compactly A > A /i i\ 

r A a =9oii A n (M2) 

and the transformation, or change of description, is described as A -> g A . 

The array of coefficients q afJL defines the change of the system of descrip- 
tion, so far as the characteristic A^ is concerned. Linear transformations 
possess the Group property ; that is to say, the resultant of a succession of 
linear transformations is a linear transformation. Thus we can have a set 
of systems of description such that, in passing from any one description to 
any other, the transformation of A^ is always linear. When for all systems 
of description contemplated the transformation of A^ is linear, A is called 
a tensor. 

By solving equations (Ml) we can find A l9 A 2 , ... in terms of -4/, A 2 ' , .... 
The resulting formulae are linear and may be written 

4r = ?o/-V- (M3) 

The array of coefficients q a ^ defines the in verse transformation to that defined 

fcysv 

If Bp is another array of n quantities occurring in the description of the 
physical system, and in the change of description in which A Q ->q aiL A tli 



14 Wave-tensor Cakulus [M 

Bp is said to be a tensor of the same kind as A^ (or to be cogredient with 

4>- 

If Cp is another array of n quantities occurring in the description, and in 
the change of description in which A a ->q afJt A^ 

C.+qr'Cp, (M5) 

C^ is said to be a tensor of opposite kind to A p (or to be contragredient to A^) 9 
Note the inversion of the order of the suffixes of q'. 

From Ap and C^ we may form an array of n 2 quantities A^G V which 
follows the transformation law 

^C^q^A^C^q^q^A^. (M6) 

If T^ v is an array of n* quantities occurring in the description, and in the 
change of description in which A^-^q^A^ 

T^q^'T^ (M7) 

(i.e. if it is transformed in the same way as A^C^ then T^ v is said to be a 
mixed tensor of the second rank of the class A^ . 

Tensor properties do not necessarily depend on the physical nature of 
the entity that is being described; they depend on the variety of descriptions 
which we admit. For example, the statement that B^ is a tensor of the same 
kind as A^ announces a limitation of the variety of description contemplated ; 
for there can be no compulsion to change our description of one physical 
feature of the system when the description of another feature is changed. 
But unless there is some systematic plan underlying our descriptions it will 
be impossible to assert any general laws governing the quantities occurring 
in the descriptions. 

For example, the strength of the wind is sometimes described by a 
number of dynes per square centimetre and sometimes by a number on the 
Beaufort scale. We cannot expect to find exact equations (relating our 
measures of the strength of the wind to other meteorological characteristics) 
applicable to both codes of measurement. By taking the wind strength to be 
a tensor of the class of tensors used for describing other meteorological 
characteristics we rule out one or other description not as illegitimate, but 
as unsuited to the purpose we have in mind, viz. to express the regularities 
underlying natural phenomena by mathematical equations governing the 
quantities which occur in our descriptions of the phenomena. 

1-2. Space Tensors and Wave Tensors. 

When the change of system of description includes a change of coordinates 
from (^ , a? 2 , # 3 , # 4 ) to (#/ , # 2 ' #3' > #4')* &* 1 infinitesimal coordinate difference 
dXp is transformed according to the formula 

dxi (1-21) 

* 



1-3] Tensors and Matrices 15 

etc. This may be written in the form (1-15) 

dx^q^'dXp (q^'-dx.'/dxj. (1-22) 

Thus every change of description contemplated as admissible corresponds 
to a linear transformation of dx^. Accordingly dx^ is a tensor; we call it 
a displacement vector. 

This is the basic tensor of the class of tensors used in the ordinary tensor 
calculus. Displacement vectors and all tensors of the same kind are called 
contravariant vectors; tensors of opposite kind are called covariant vectors. 
Mixed tensors of the same class are defined as in (M7); and more generally 
tensors of higher rank with 16, 64, 256, ... components are introduced, their 
transformation laws being 



We shall call this class of tensors space tensors. 

Thus, although the theory of tensors belongs primarily to the algebraic 
theory of transformations, it has usually been linked to geometry by 
identifying the basic tensor of the algebraic scheme with a geometrical 
displacement or coordinate difference dx^ . We shall here discard this special 
linkage. We shall introduce another class of tensors called wave tensors, 
derived from a basic contravariant wave vector ^ in the same way that the 
space tensors are derived from the basic contravariant space vector dx^ . 

For the moment we leave the basic wave vector unidentified. But at a 
certain point in the development of the system of wave tensors, we shall be 
able to side-step into a new class of tensors. On examining the properties 
of the new tensors we shall find that they can be identified with space 
tensors. Thus the wave-tensor calculus leads up to the ordinary space-tensor 
calculus and includes it as a side branch; but its greater comprehensiveness 
fits it to deal with certain entities in modern quantum theory which are not 
describable by space tensors. 

The basic wave vector will be identified in Chapter v. It turns out to be 
the four-valued wave symbol introduced into physics by P. A. M. Dirac in 
his linear wave equation of the electron. Vectors of this class cannot be 
reached from the ordinary calculus 6f space tensors, which does not begin 
far enough back. Our plan accordingly is to begin with these vectors, and 
lead up to the ordinary space vectors at a later stage. 

1*3. Chain Multiplication. 

Let Aj, Bj be two mixed tensors of the second rank. Having regard to the 
summation convention we recognise four different products 

A*B* 9 AJB V \ A*Bf % AjBf. (1-31) 

The first is the outer product, and the fourth is the inner or scalar product. 



16 Wave-tensor Calculus [1-3 

The second and third are called matrix prodticts and are denoted by AB and 
BA respectively. 

Matrix products are formed by chain multiplication, i.e. the second suffix 
of one factor is repeated as the first suffix of the succeeding factor (the 
repetition introducing a summation in accordance with the summation con- 
vention). The product AjB* is of this form. AjBf- is not a chain product 
as it stands; but it becomes one if it is rewritten as BfAj . 

On the understanding that chain multiplication is the only kind of 
multiplication admitted, no suffixes need appear in the formulae, since the 
reader can always supply appropriate suffixes when required. Thus the 
product of a number of double-suffixed quantities is written 

P = ABCD, (1-321) 

which stands for P^ABjCD. (1-322) 



This rule of multiplication is the distinctive feature of the matrix calculus. 
The notation is so useful that wo cannot afford to do without it. Nevertheless 
matrix calculus suffers from being more limited than tensor calculus; and 
we often want to introduce outer and scalar products and other combina- 
tions for which matrix calculus provides no notation. This necessitates 
resorting to various awkward shifts, and occasionally reverting to the full 
suffixed expressions. 

Chain multiplication does not contemplate quantities with more than two 
suffixes. We shall at first limit the term Sk matrix " to two-suffixed quantities 
representing two-dimensional arrays. Technically one-dimensional arrays 
are also matrices, but it would probably be confusing to include them. One- 
dimensional arrays will here be called vectors, even when no question of 
transformation properties arises. The term implies very little restriction so 
long as we do not specify the kind of vector. 

Chain multiplication cannot be carried beyond a vector, so that vectors 
can only occur at the beginning or end of a matrix product. We shall dis- 
tinguish initial vectors by an asterisk, final vectors being unmarked. This 
notation allows us to reintroduce outer multiplication to a limited extent. 
The rule is that, if it is impossible to interpret two symbols in juxtaposition 
as a chain product, they are to be interpreted as an outer product. Thus if 
0^ is a vector, the expressions 



are interpreted as (A$) xB, Ax 

where the symbol x indicates outer multiplication, chain multiplication being 

impossible after a final vector or before an initial vector. Or, with suffixes, 



1-4] Tensors and Matrices 17 

In particular, we have the following notation which is of great importance 

0X* denotes the outer product ^uX 



t \ ***t5 J 

denotes the scalar product x^u-J 

The asterisk is a substitute for suffix indications, and is dropped when the 
suffixes are inserted. 

A feature of matrix multiplication is that it is non-commutative; that is to 

sa y BA*AB. (1-34) 

It is to be remembered that the non-commutation only arises through the 
omission of suffixes; when suffixes are inserted in BA, the factors commute 

as usual. Thus iy*4/-4/V- ( 1>35 ) 

Since the suffixes are often omitted, we can no longer depend on dis- 
criminating contravariant from covariant vectors by the upper and lower 
positions of the suffixes. There would be little advantage in retaining a 
method of discrimination which only worked spasmodically. Accordingly, 
we shall in future generally write all wave tensor suffixes in the lower 
position. 

1*4. Transformation Laws of Wave Tensors. 

In 1-1 we introduced three kinds of tensors of the class A^ with trans- 
formation laws (1-14), (M5) and (1*17) respectively. The formulae may be 
written as B' aB C' = Ca ' T '= ' 

^a qa, J '> ^a u ixor > - L ar 



The products, as here written, are all chain products, so that the suffixes 
may be omitted and we have 

B' = qB, C*' = C*q', T' = qTq'. (1-41) 

Further by (1-12) and (MS) 



Therefore AJ = q^q^'A^ . (1-421) 

But A ' = *^A r ' 9 (1-422) 

where 8^ is the substitution operator, viz. 



= 0, 

Since A r f is an arbitrary array of four numbers, it follows from (1-421) and 
(1-422) that g^/^. (1-44) 

The left-hand side is a chain product; we can therefore drop the suffixes, 
obtaining ?? , = s (1 . 45) 



18 Wave-tensor Calculus [1-4 

In matrix calculus 8 has the algebraic properties of the number 1. For, 
if 8 is any matrix 88=8 88=8 

so that, dropping suffixes, 8/9 = $, 88 = 8. 

Accordingly 8 is called the unit matrix] and since it is equivalent to the 
number 1 in matrix calculus, we shall often denote it by 1, or with suffixes 
(1) . Then (1-45) becomes qq' = 1. Thus q' may be called the reciprocal of q, 
and it will sometimes be written as q- 1 . 

The formulae (1-41) and (1-45) constitute the principal transformation 
formulae in wave-tensor calculus. Summarising our results for reference, 
and changing to the notation which we shall usually employ, we have the 
following classification and nomenclature: 

Covariant (final) wave vectors 

i/j' = qtfj. (1-461) 

Contravariant (initial) wave vectors 



Mixed wave tensors T' = q Tq', (1-463) 

with ff?' = l- (1-464) 

These would reduce to the transformation laws of the ordinary tensor 
calculus if we set ^ = a.yaV, ^' = ^7^. <1') 

But, as already explained, the wave tensors are not linked to geometry in 
this way, and (1*47) does not apply. For a transformation of wave tensors, 
any matrix which has a reciprocal maybe used as q; that is to say, the corre- 
sponding transformation will give a new description which is included in 
the whole group of descriptions contemplated. 

A matrix which has no reciprocal is said to be singular. A singular matrix 
may be regarded as a generalisation of the algebraic number in much the 
same way that the unit matrix is a generalisation of the number 1 ; but there 
are infinitely many different singular matrices. As q approaches a singular 
value, one or more elements of its reciprocal q' tend to infinity; singular 
matrices q are therefore excluded in the foregoing transformation theory. 

1-5. Initial and Final Wave Vectors. 

The terms "initial" and "final", applied to wave vectors, define their 
behaviour in regard to chain multiplication, and do not necessarily describe 
their actual position in the sequence of factors (cf. (1-33)). As far as 
Chapter vi (inclusive) the initial vectors will be contravariant and the final 
vectors covariant. But it must not be supposed that this is a general rule, or 
that the asterisk is a symbol for contravariance. 



1-5] Tensors and Matrices 19 

In order to express the covariant transformation law (1*461) in a form 
appropriate for an initial covariant vector, we introduce a matrix q which is 
the transpose of 0, obtained by interchanging rows and columns; thus 

Then (1-461) stands for 

$* 

Hence, dropping suffixes, 0*' = 0*</. Treating (1-462) similarly, we have the 
transformation laws: 

Initial covariant wave vectors 

t*' = *li*q, (1-521) 

Final contravariant wave vectors 

x ' = q' x . (1-522) 

The outer product ift<f>* of two covariant wave vectors 0, $ is a covariant 
wave tensor S. Using (1-461) and (1-521) we obtain the transformation law: 

Co variant wave tensors S' = qSq. ( 1 53 ) 

These formulae will not be required until Chapter vn. 



CHAPTER II 
THE SIXTEENFOLD FRAME 

2* 1 . Symbolic Calculus . 

For our physical applications the significance of a matrix is embodied, not 
so much in its representation as an array of numbers, as in its non-commu- 
tative multiplication property (1*34). Most, if not all, of the properties of 
matrices which make them suitable for describing the conditions and 
activities of the physical universe are also possessed by general symbols 
endowed with the same non-commutative properties. 

We shall therefore develop a calculus containing a number of symbols 
which do not obey the commutative law of multiplication, but obey the 
other elementary laws of algebra. The following definitions are adopted: 

A symbol which commutes with every symbol in the calculus will be 
called etn algebraic number. 

The number 1 is defined to be a symbol which satisfies 

l.E = E.l = E, 

where E is any symbol in the calculus. From the definition of 1 the defini- 
tions of other algebraic numbers follow in the usual way. In particular i is 
defined to be a symbol satisfying 

iE = Ei, UE=-E. 

The underlying idea is that a symbol has no properties except such as 
are manifested by it in the operations of the calculus in connection with 
which it is used. Its nature lies in its behaviour; it has no intrinsic nature. 
Therefore if a symbol behaves like the number 1 in every possible operation 
of the calculus, it is the number 1. If our calculus is afterwards extended by 
the introduction of additional symbols or operations which give a further 
opportunity for discriminating behaviour, some of the symbols originally 
counted as algebraic may cease to be algebraic. We may regard "algebraic " 
as a relative characteristic depending on the range of symbols which con- 
stitutes our calculus. 

I have here deviated from the terminology in pure mathematics, where it 
is customary to give a much wider meaning to the term "algebraic". But 
I think that most readers of a physical treatise will naturally understand 
"algebra" to mean "ordinary algebra"; and therefore the distinction 
between quantities which obey the rules of ordinary algebra (including the 
commutative law of multiplication) and those which do not is most in- 
telligibly described by the adjectives "algebraic" and "non-algebraic". 



2-2] The Sixteen/old Frame 21 

2*2. Complete Orthogonal Sets. 
Let E l9 E 2 , jE 3 , JP 4 be four symbols which satisfy 

jy=-l, E^-E^ (^=1,2,3,4;^,,). (2-21) 

That is to say, the symbols are four mutually anticommuting square roots 
of 1. We shall find in 3*2 that there exist matrices which satisfy (2-21), 
so that we need have no qualms as to the legitimacy of postulating such 
symbols. 

When we are given an even number of anticommuting square roots of - 1 , 
we can always find an additional anticommuting square root, making the 
total number odd. Let iE^E^E^. (2-22) 

We have (iE 5 )* = E^ E 2 E 3 E^ E 2 E 3 ff 4 

_ JET J7J J7I J/T M JB1 J7I Tfl 

== JLJ-[ J2/J 12/2 2 3 3 4 4 ' 

since the rearrangement of order involves six jumps of a symbol over a 
different symbol, and each jump reverses the sign of the expression by (2-21). 

Hence 



so that E B 2 = - 1. We can verify similarly that E l E 6 = - E b A\ , etc. 

Thus we have five symbols satisfying (2-21). Both equations of (2-21) 
are included in the form 



)=-8^ (^=1,2,3,4,5), (2-23) 

where is the symbol defined in (1'43). 

Any product formed by repeated multiplication of E 19 E 2 , E. 3 , # 4 can be 
reduced to the form E 1 p E 2 Q E^ r E 4L 8 , since in collecting the factors the 
alteration of order can at most change the sign of the product. Also, since 
E fJ ?= 1, E^ J reduces to E^ or 1. Thus, disregarding sign, the product 
reduces to one or other of sixteen forms: 

1, E^ E^E V , E fi E v E a , E^EsEt (^ v, a = 1,2,3,4; p*v* or). 

Multiplying (2-22) by E , we have ( 2 " 24 ) 

iE l E 5 = E 1 2 E 2 E^E^= E 2 E 3 E^, 

so that by using E 5 the triple products can be reduced to double products. 
Disregarding factors 1, i, the forms (2-24) are equivalent to the sixteen 
forms i, E^ E^E V (^ v= 1,2, 3,4,5; ^v). (2-25) 

As here written they are all square roots of 1; since 

(EpErf-E^EpE^ -E^E V E V = -EJE,?--!. 

A linear function of the sixteen expressions (2-25) with algebraic coeffi- 
cients (real or complex) will be called an E-number. We see that the opera- 
tions of addition, subtraction and multiplication applied to JS?-number 
will always yield JB-numbers. In virtue of this property the sixteen expres- 



22 Wave-tensor Calculus [2-2 

sions are said to constitute a complete set. Similarly in algebra the symbols 
1 and i constitute a complete set, since the operations of addition, subtrac- 
tion and multiplication applied to complex numbers always yield complex 
numbers.! 

For reasons which will appear later the sixteenfold complete set here 
introduced is called an orthogonal set. 

The -E-iiumbers are a particular case (n = 4) of Clifford's numbers, J 
which are formed analogously from any even number n of independent 
anticommuting square roots of - 1. Since the J-numbers, or their equi- 
valent matrices, play a fundamental part in the physical theory which we 
shall develop, the theory is dependent on the choice n = 4 which we make at 
the outset. This choice will ultimately be justified in 16*8, where it is 
shown that it is imposed by the epistemologicai principles involved in the 
conception of measurement. 

2* 3 . Notation of the E -symbols . 

We shall write E^E^ (p, v= 1, 2, 3, 4, 5; /^ ). (2-31) 

For uniformity we also give the original five symbols an alternative double- 

suffix notation, viz. jr JT w /OQO\ 

\ = A)/* = ~ A*o ( 2 ' 32 ) 

Then the sixteen expressions (2-25) which constitute the complete set 
become ^ ^ ()Lt)1 , = o, 1,2,3,4,5; p*v). (2-33) 



By (2-31) and (2-32) we have in all cases E^ v = E vfl . In making up the 
complete set of sixteen symbols it is arbitrary whether we employ E^ v or 
E V p . It would, of course, be redundant to include both. 

By using (2*21) and (2-22) we find the following general rules of multi- 
plication: ^^=-1, (2-341) 

*V^a= -E^ v = E va , (2-342) 

E^E^ E^E^iE^ (2-343) 

where /*, v, a, T, A, p is any even permutation of 0, 1, 2, 3, 4, 5. For an odd 
permutation E^E^^ -iE^. 

The summation convention is not used in the above formulae. Unless 
otherwise stated we shall limit the summation convention to the row-aiid- 
column suffixes of matrices and wave vectors. In later developments the 
symbols E^ v will be identified with matrices; they will then have the form 

t Some writers use the term "complete set" for the group of linear expressions (in this 
case the j-numbers). The expressions in (2-25) would then be called "generators" of the 
complete set. 

J Amer. Jvurn. Math. 1, 350 (1878). 

The summation convention is also employed when well-known formulae are quoted from 
general relativity theory. 



2-3] The SixteenfoU Frame 23 

C^/zv)a)3> where a and ]8 indicate the element in the ath column and j8th row 
of the matrix. In that case the summation convention will apply to a and ]3, 
but not to the suffixes p., v which distinguish one matrix of the set from 
another. 

By (2-342) and (2-343) the JE-syrnbols commute or anticommute according 
as they have no suffix or one suffix in common. We therefore obtain a sub-set 
of mutually anticommuting symbols by fixing one of the two suffixes and 
letting the other vary, e.g. 

^30> ^31> ^32> ^34 ^35' 

We call such a sub-set a pentad. There are six different pentads; and each 
symbol is a member of two pentads. Our original symbols E ly E 2 , E$, E 4 , K 5 
constitute the pentad with fixed suffix 0. It will be seen that, if wo start 
from any of the other pentads ami follow the same treatment, we reach the 
same complete set. 

There exist also triads, i.e. sets of three mutually anticommuting E- 
symbols, which do not form parts of pentads, viz. 

E^, E va9 E afA (n*v*a). (2-35) 

The maximum number of mutually commuting ^-symbols (excluding i) 
is three; for no two of them can have a suffix in common, and therefore three 
symbols exhaust the six possible suffixes. The three commuting symbols 

.are accordingly 

E^ , E aT , E^ (/*, v, or, T, A, />, all different). (2-36) 

We call such a set an anti-triad. Adding to it the symbol i, which commutes 
with all symbols, wo obtain an anti-tetratl. 

The two triads E^, E va , E Q ^\ E rX , 7? v E pr , (2-37) 

where p,, v, a, T, A, p are all different, are called conjugate triads. They have 
the property that every member of one triad commutes with every member 
of the other (see 3-8). 

We shall often employ an alternative single-suffix notation for the 
JSr-symbols (2-33), viz. E^ (JLC= 1, 2, ... 16). It is then understood that the 
first five symbols form a pentad, and that 

Ei* = i, (2-38) 

but the order of the others is left unspecified. A general U-number is then 

T^Z t^, (2-39) 

/x-l 

the coefficients ^ being algebraic numbers, real or complex. The individual 
terms t^E^ are called components of the ^/-number. The algebraic com- 
ponent Ji 6 JS7i6i or i 18 , will be called the quarterspur (abbreviated as qs). We 
have therefore qg T ^ ^ ^ (2-395) 

If qs T = 0, I 7 is said to be degenerate. 



24 Wave-tensor Calculus [2-4 

2*4. Linear Independence of the E -symbols. 

(a) If a complete set is multiplied through by any one of its members, we 
obtain the same set in a different order, apart from algebraic factors 1 or 
i. This follows from (2-34). 

(6) If an JS-number vanishes, every component is zero. For suppose that 
the ^-number 



. + t m S m = 0, (2-41) 

the coefficients being non-zero. Multiply through by E m \ it follows from (a) 
that we obtain an expression of the same form and with the same number 
of terms as (2-41). The last term is t m E m 2 = t m (or it m E u ). Accordingly, 
let the result be / 



/ n tv jo\ 

.-t m = 0. (2-42) 

Let E r be one of the symbols which anticommute with J57 a . Multiply (2-42) 
firstly by initial E T and secondly by final E T , and add. Then 

t OL (E T E OL + E 0i E r ) + tp(E T Ep + EpE r ) + ...^2t m E T = 0. (2-43) 
The first term vanishes, and possibly some of tho other terms; but the 
equation cannot wholly disappear since the last term does not vanish. If 
E T commutes with Ep, (E T Ep + EpE r ) = k 2iE a , where E a is another symbol 
of the set, by (2-343). Hence (2-43) reduces to an expression of the same 
form as (2-41) but with fewer terms. 

By repeating the whole process as often as required we remove all terms 
except the last; we are then left with an equation containing just one non- 
zero term which is absurd. Thus an equation of the form (2-41) is impossible 
unless all the coefficients are zero. 

This shows that the ^-symbols are not connected by any linear algebraic 
identity. In other words the set is complete but not redundant. 

2*5. Miscellaneous Properties. 

The following easily established properties of J?-symbols are collected here 
for reference: 

(a) Each symbol (except E IQ ) anticommutes with eight symbols, viz. the 
remaining members of the two pentads to which it belongs. It commutes 
with the remaining eight symbols, which include itself and E u . 

(6) Each symbol (except E 1B ) anticommutes with at least one member of 
any given tetrad. (A tetrad is formed by four members of a pentad.) For if 
the tetrad is E ol , E Q2 , E Q3 , J5 04 , the symbol E^ has one suffix in common with 
one of these unless both a and r are 5. But a and r cannot be the same. 

(c) If an JK-number commutes with E^ , every non-vanishing component 
commutes with E^. For the condition that Et v E y commutes with E^ is 

s,u^-v,)=o. 

Terms for which E V9 E^ commute disappear; terms for which E V9 E^ anti- 
commute reduce to the form 2t v E a by (2-342). No two terms reduce to 



2-6] The Sixteenfold Frame 25 

the same E a . By 2-4 (6) the coefficients of these surviving terms vanish 
separately; that is to say, t v = for those components E v which do not 
commute with E^ . 

(d) Similarly if an -E-number anticommutes with J , every non-vanishing 
component anticommutes with E^ . 

(e) If an jE7-number commutes with each member of a tetrad, it is an 
algebraic number. For by (c) its non-vanishing components commute with 
each member of a tetrad, and by (6) no ^-symbol other than A\ 6 can do this. 
The J5-number therefore reduces to E^t^, or it lB . 

(/) For any /?-number T we have 

ji=lG 

2 E ' TE = -16qsT. (2-51) 

/t=i 

Consider the component t v E v . We have 

EpE Ep = -E v , if E v , A; commute, 

= + E v , if E v , Ep onticommute. 
Hence, if v^ 16, we have by (a) 



For v= 16, ^A;^ 16 ^= 

Hence ^E^TE^ - 16# 16 16 = - I6qs T. 

(g) The coefficients ^ of an ^-number satisfy 

tp= ~qs(T^)= -qs(^y). (2-52) 

Let H=TE fJL . Each component of S corresponds to a single component of 
T by 2-4 (a). The quarterspur of 8 corresponds to the component t^ E fl of T, 
and is therefore equal to t fA E fJi .E^= t^, which proves the theorem. 
Combining (2-51) and (2*52) we obtain 

m^-l^^(TE a )^^E^TE E^^E TE^ (2-53) 
(h) If S and T are U-numbers 

(2-54) 



2*6. Reciprocals. 

Let S and T be Jff-numbers. Generally there exist two quotients T/S, 
which are the ^-numbers R, R' defined respectively by 

R8=T, SR' = T. (2-61) 

Considering the first of these equations, the vanishing of the jB-number 
RST requires that every component of it should vanish. We have there- 
fore 16 equations (linear in r^) to determine the 16 coefficients r^ of the 
J5-number R. A solution will exist unless the determinant of the coefficients 



26 Wave-tensor Calculus [2-6 

of the r^ vanishes. Since the coefficients of the r^ are furnished by 8, the 
existence or non-existence of a solution depends on 8 but not on T (assuming 



If there is no solution, i.e. if 8 fails as a divisor, 8 is said to be singular. 

In particular, taking jf=l, an JEJ-number 8 will have a reciprocal R 
(such that R8 = 1) unless it is singular. A singular J?-number has no re- 
ciprocal. 

If R8=l, SR' = 1, (2-62) 

we have R = R (8R f ) = (RS) R = R', (2-63) 

so that the same reciprocal is obtained by either definition. An jE-number 
commutes with its reciprocal. 

Let T be an JSJ-number which commutes with 8. Denoting the reciprocal 
of S by S~\ we have S -i ^ T g ^ S -i _ $-1 ^ ST ^ g-^ 

whence S~ 1 T=TS- 1 . 

So that an U-number which commutes with 8 commutes also with its 
reciprocal. We can also show that, if two TiJ-numbers commute, their reci- 
procals (if any) commute. 

If 8 is singular, the vanishing of the determinant of the coefficients makes 
it possible to obtain an infinitude of solutions of 

RS = 0. (2-64) 

Any such solution R is called a jtseudo-reciprocal of 8. A pseudo-reciprocal 
is necessarily singular. If R is a pseudo-reciprocal of 8, XR is also a pseudo- 
reciprocal of 8, X being any jG7-number ; for if RS = 0, X RS == 0. A product 
of JB'-numbers is singular if any of its factors are singular. 

It is important to notice that, when 8 and T are j&-numbers, the equation 
ST = does not imply that either $ = or T = 0. There is an alternative, 
viz. that 8 and T are singular. If, however, 

SEpT = Q (2-65) 

for every symbol E^ of the complete set, then either $ = or T = 0. For 
suppose that 8^0. Then by (2-53) 



= 0, by (2-65). 

Hence t a = 0. Since this applies to every component t a , it follows that T = 0. 

If 8 is singular and RS contains no non-algebraic terms, then 

jRS = 0. (2-66) 

For if RS were a non-vanishing algebraic quantity a, JR/a would be the 
reciprocal of 8, so that 8 could not be singular. 

Since E^.E^l, the symbols E^ are not singular. Hence if 



2-7] The Sixteen/old Frame 21 

2*7. Transformation of Complete Orthogonal Sets.f 
Let F^qE^q', (2-71) 

where 92 / = ?'?= 1 - (2-72) 

Then the F^ form a complete set having the same structure as the set of E^ . 

This is proved by showing that the relations (2-34) pass over unchanged 
from the E^ to the F^. Taking, for example, (2-342) 

F^F^qE^q' .qE^q' 

^qE^E^q' by (2-72) 
= qE va q' = F v <,. 

Here q and q' may be jE-numbers or they may involve entirely new symbols. 
We make no assumption as to their nature. 

The converse theorem is that if JE^, F^ are two complete orthogonal sets, 
arranged in corresponding order so that F^ , F v commute or aiiticomniute 
according as E^ E v commute or anticommute, there exists a transformation 
(2-71) connecting them. We shall prove this under the restriction that 

(a) The F^ are ^-numbers, or 

(6) The Fp are new symbols which commute with all the E^ . 

16 16 

Let P-aZJ^L, P'-aSlZUP, (2-73) 

l r r l r f* 

a being an algebraic number. Consider the expression F V PE V . It has 16 
terms of the form aj p / p jg \ ^ 

If E v , Ep anticommute, and therefore F v , F^ anticommute, this becomes 
-oF^J^JEf,,, which is of the form -*F Q E a by (2-342). If E v , E^ and 
F V9 Fp commute, it becomes ^.F^F V E^E V , which is of the form a (iF a ) (iE a ) or 
-u.F E a by (2-343). Thus F V PE gives the 16 terms of -P in a different 
order; hence ^ pE ^ = _ p (2 . 741) 

Similarly E V P'F V =-P'. (2-742) 

Multiplying by final E v and final F V9 respectively, these give 

F V P = PE V , E V P' = P'F V . (2-75) 

Case (a). F^ is an jB-number. 

Multiplying together the two equations in (2-75), we have 

E V P'PE V =P'F V F V P= -P'P. 
Hence, multiplying initially by E V9 



Therefore P'P commutes with every E V9 i.e. with every symbol in the 

calculus. It is therefore an algebraic number. Reserving the singular case 

t This transformation was introduced by G. Temple, Proc. Boy. Soc. A, 127, 342 (1930). 



28 Wave-tensor Calculus [2-7 

P'P = 0, for consideration in 2-8, we can choose a so as to make P'P = 1. Tt 
follows that PP' = 1; and by (2-75) 



so that P is the required transformation operator q in (2-71). 

Case (b). F^ commutes with the E^. 

Then P = P'. If P is multiplied by E^F 19 we obtain the same 16 terms in 
a different order, except that those which commute with E l (and therefore 
with FJ acquire a factor i 2 by (2-343). Thus 8 terms are reversed in sign. In 
the product P x P, each term of P occurs 16 times, 8 times with the original 
sign and 8 times with reversed sign, except that E IB F 19 occurs 16 times with 
reversed sign. Hence p 2 = a 2 ( _ 16^ 16 P 16 ) = 16a 2 . 

Taking a = J, we have PP' = P 2 = 1. Then by (2-75) 



Hence the required transformation operator is 

? = <?' = iS# /t *;. (2-76) 

2-8. The Singular Case. 

In Case (a), but not in Case (6), it may happen that PP' = and the fore- 
going method of determining q breaks down. We shall show that never- 
theless there is a transformation F^qE^q'; but instead ofq = P, we have 
^P^), where p^^^^ p^^^E^F^, (2-81) 

and E a is one of JK-symbols. The transformation previously given corre- 
sponds to a = 16; if that fails we try another value of <r, until we find one such 



Ordinarily the vanishing of PI " does not imply that either P or P' is 
zero; but in the present case we have, by (2-75), 



for every E v . Hence by (2-65) either P = or P' = 0. 

Let EM = (iE a ) Ep (iE a ) --E^E,,. (2-82) 

Since this is a transformation of the form qE^q' with qq' = 1, the JS (or) form 
a complete set. By (2-82) E^^E^ or -E^ according as E a commutes or 
anticommutes with E^\ thus the transformation simply reverses the signs 
of eight members of the set. We call the set E^ a reflection of the set E . 
Including the original set (reproduced when a = 1 6) there are sixteen different 
reflections, which correspond to the sixteen possible combinations of sign 
in an initial tetrad E l9 E 2 , J 3 , E. 

By (2-51) S a ^)=16qsJ^. (2-83) 



2-9] The Sixteen/old Frame 29 

Hence by (2-81) 

E a PMJ0 a = - IGaZ^qs^ - 16o^ 16 # lfl = 16a, (2-84) 

since qs E^ = unless p, = 1C. Hence at least one of the quantities 1* V} E has 
a non-vanishing quarterspur. Therefore at least one of the quantities P (a) 
does not vanish. 

The quarterspurs of P (o) E a and E a P' (af) are equal; for they are the quarter- 
spurs of - ocS2^ /y a > and - oEE^F^ , which are equal by (2-54). Hence, in 
securing that P^^^Q, we also secure that P' (a) ^ 0. 

Having found non-vanishing P^ a \ P r(a \ we use these instead of P 9 P', and 
repeat our previous analysis as far as (2-75), obtaining 

f w pw=pto]B, 9 E V P'M = P'WF V . (2-85) 

It will be found that E a remains passive in the middle of the expressions 
F v PWE v and does not affect the argument. Proceeding to Case (a), we find 
as before that P'^pw is algebraic. Further, it cannot vanish; for, as shown 
for P, P' at the beginning of this section, its vanishing would require that 
either P^> or P' (a) is zero. 

It may seem curious that we should be able to choose a arbitrarily in the 
transformation jF^ = P^/S^ P' (CT) . The explanation is that by changing a we 
introduce a purely algebraic factor which is absorbed in a. Evidently the 
transformation could be further generalised by substituting an arbitrary 
^/-number in place of E a . 

We notice for future reference that there is at least one reflection E^ 
which is connected with F^ by the unmodified transformation q = c 
q' = KZE^FH . For, choosing o- so that P< a) , P'< a) * 0, we have 



-aEE.EpE.Fn~ - 

2-9. Application to Relativity. 

The transformation F^qE^q' is formally the same as the transformation 
(1-463) of a mixed tensor T' = qTq'. Limiting ourselves for the present to 
Case (a), q is now a non-singular jfir -number instead of a non-singular matrix. 
We shall find in Chapter in that fourfold matrices are a special representa- 
tion of Jfi/-numbers. Thus it is appropriate to generalise the definitions of 
tensors in Chapter I by substituting symbolic JS?-numbers for matrices. 

In physical applications we shall call a "complete set of E^ a symbolic 
frame. By the above transformation we obtain a different but equivalent 
symbolic frame F^ . If the E^ are taken to be mixed wave tensors, the 
change from one symbolic frame to an equivalent frame is a tensor trans- 
formation. A change of symbolic frame is then part of a general change of 
system of description, and other quantities occurring in the description are 



30 Wave-tensor Calculus [2-9 

changed simultaneously according to the wave tensor character assigned 
to them in the group of descriptions contemplated. 

The present analytical theory is being developed to serve as a tool in 
physical investigations, and we cannot pledge ourselves always to use the 
tool in one particular way. But the primary application will be that certain 
characteristics of a physical system are described by JP-numbers which are 
invariant for changes of the system of description, and therefore correspond 
in conception to an absolute structure transcending our variable description. 
Let T = T&tpEp be one of these invariant ^-numbers. The invariance requires 
that when we transform to a new symbolic frame E^', the coefficients t are 
changed to */ so that T^E^S^'E^. (2-91) 

The arrays ^ , t^ are regarded as components of the same physical entity 
referred to two different reference frames E E'. 



It is convenient to call the transformation E^E ' a rotation of the 
symbolic frame. Rotation is here given a somewhat generalised meaning; 
the emphasis is on the fact that it is a type of change which does not involve 
any intrinsic distortion of the frame. The frame JB ' has the same intrinsic 
structure as the frame E^, namely that expressed by equations (2-34). 

Having defined rotations of the frame we can now define corresponding 
rotations of a physical system described by ^-numbers. Consider a system 
described by invariant A 7 -numbers Z^S^JS^, J7 = S^jB r /x , etc. Let the 
system undergo a change such that T-+T', U-> U f , etc., where 

T'^pEp', U' = Vu^. (2-92) 

Then the new physical system is constructed in the frame E^ according to 
the same specification as that by which the original system was constructed 
in the frame E^. In other words the system has rotated with the frame. 
Clearly the systems (T, C7, ...) and (T f , U', ...) have the same kind of 
equivalence as the frames. They are intrinsically similar, as the frames are 
intrinsically similar. 

Normally the rotation of a physical system is described by referring it to 
a fixed frame. We therefore require the components of T' in the original 
frame E^. Denoting these components by t^ (not the same t^ as in (2-91)) 
the condition is r = a ^, = a ^ (2 . 93) 

The transformation J /4 ->< / / represents a rotation of the physical system 
relative to the fixed frame E^. Since E^^qE^q'y we have by (2*93) 

Xtp'E^q&tpEJq'. (2-94) 

The nature of the transformations of t^ determined by (2-94) will be studied 
in detail in Chapter iv. 

Any change ^->^' represents some imaginable change of the physical 
system described by ^ . The peculiarity of the transformations which satisfy 



2-9] The Sixteen/old Frame 31 

(2-94) is that the new system is intrinsically similar to the old, and the 
change is therefore pictured geometrically as a rotation without distortion. 
More generally we call such a change a relativity transformation. It can bo 
detected (if at all) by observing relations to extraneous physical objects 
that do not form part of the system to which the relativity transformation 
is applied. 

The relativity of our orthogonal symbolic frames is precisely analogous 
to the relativity of Galilean frames of space and time. Space-time frames 
are all alike initially. If we speak of a frame x, y, z, t, it is impossible to define 
in an absolute way which frame, out of an infinite number of equivalent 
frames, we refer to. But when once we have selected and labelled an initial 
frame A, any other frame B can be defined relatively to it by specifying the 
space rotation and Lorentz transformation which would convert A into B. 
Similarly we cannot define in an absolute way the frame E^ which we select 
initially. But any other frame E^ can be defined relatively to E^ by speci- 
fying the transformation symbol q (which is an ^-number of the form 
Sgr^ .By) connecting E^ and E^ f . We use these symbolic frames as the basis of 
a relativity theory which (we shall find) includes, but is somewhat more 
comprehensive than, the relativity of Galilean frames of space and time. 

Attention may be called to the perfect adaptation of the mathematical 
symbolism to the physical conditions. Owing to relativity we are unable to 
define in an absolute way the physical frame initially selected, which we label 
Ep . It is therefore appropriate that we should be equally unable to define 
in an absolute way the label E^ which we affix. For the set of symbols E^ 
is only defined by its structural properties (2-34), and these apply equally 
to Ep or to any other complete orthogonal set. The complete physical 
equivalence is therefore represented by a complete mathematical equi- 
valence. We lose this perfect adaptation when we use special kinds of / /A , 
e.g. matrices. 

There is no absolute distinction between a rotation of the physical system 
and a rotation of the frame in the opposite direction; and in elementary 
theory the term "relativity rotation" is applied indifferently to rotations 
of the physical system and of the frame. But after the first results of this 
equivalence have been gleaned, there is seldom anything to be learned by 
introducing rotations of the frame. If the frame is rotated, we have to 
transform simultaneously the specification of all objects, fields, boundary 
conditions (including boundary conditions at " infinity"), normalising 
conditions, etc., concerned in the problem. On the other hand, keeping the 
frame fixed, we can introduce relativity rotations of a particular object, 
leaving the other objects concerned in the system unchanged. In order that 
it may possess independent relativity rotations, the object must be con- 
ceived as separable from the rest of the system contemplated. A separable 



32 Wave-tensor Calculus [2-9 

object will have a structure described by invariant Jff-numbers T, Z7, ..., so 
that transformations of the form (2-94) represent displacements of the 
object without intrinsic change of its structure. 

Thus in later developments we are concerned with independent relativity 
rotations of individual objects, which provide much wider scope for the 
application of relativistic principles than a rotation of their common frame.! 
For this reason, "relativity transformation" will normally mean displace- 
ment without intrinsic change of an object referred to a fixed frame, though 
it may also be applied to a rotation of the frame if occasion arises. 

In practice an object cannot be rigorously separated from its surroundings ; 
if it could be separated, it would not be accessible to observation. But that 
does not do away with the usefulness of the conception of a separable object. 
One of the greatest achievements of current quantum theory is that it 
has found a rigorous method of avoiding this dilemma. An incompletely 
separated object is represented as a probability distribution over completely 
separated states. The environment then affects, not the state, but the 
probability attached to the state. We may therefore, with all rigour, apply 
relativity transformations to the IS-numbers describing the states pro- 
vided, of course, that the states are such as can be specified by invariant 
.E-iiumbers. This last reservation is liable to be overlooked ; and ^-numbers 
(or the equivalent Dirac wave functions) have often been applied to states 
which obviously do not possess the relativistic properties which jE-numbers 
are designed to represent. 

From one point of view the assumption that there exist in nature equi- 
valent 16-fold frames, which can therefore be appropriately represented by 
equivalent sets of j&'-symbols, is a hypothesis the fundamental hypothesis 
of our theory. But actually we appeal to an epistcmological principle which 
goes deeper than that. We will call it the k< Principle of the Blank Sheet". 

Physics is concerned with the problem of distinguishing and classifying 
the distinctions of objects, states, events. Exact measurement is a process 
of determining and classifying minute distinctions. To develop a theory of 
the characteristics which can be distinguished and of the measurement of 
the distinction, we require a blank sheet to write on not a sheet already 
scribbled over with vaguely recognised distinctions. A group of intrinsically 
indistinguishable frames is chosen as the basis of a description of the uni- 
verse, in order that the theory of distinguishable or measurable phenomena 
to be erected on it may go down to the very origin of their distinction. Tn 

t The above remarks refer to the more usual problems of quantum theory in which a 
number of objects are referred to a single frame. An important Intermediate stop is the 
consideration of the relativity rotations of two objects referred to a double frame E fJL F v . 
In this case any combination of rotations of the two objects is equivalent to a transformation 
to an equivalent double frame E^F f . This (rather unusual) development is of great 
importance in the special problems treated in the present book. 



2-9] The Sixteen/old Frame 33 

practice we do distinguish frames of space-time otherwise than by their 
transformation relations to one another; we distinguish them as being at 
rest relatively to the earth, sun, etc., or as having a special orientation with 
respect to the earth's gravitational field; but we conceive frames of space- 
time as initially indistinguishable in order that these distinctions may be 
properly inserted in the development of the theory and not hidden in its 
initial assumptions. We do not assert that there exist intrinsically indis- 
tinguishable 16-fold frames of reference in the physical world; it is only for 
an ideally simplified universe that this would be true. Our principle is that 
such distinguishability of the frames as occurs must be treated as a positive 
characteristic to be represented by appropriate symbols and combined in a 
unified theory with the other distinctions studied in physics. To exhibit a 
positive characteristic, we have to imagine a frame which initially lacks it. 

Our mode of thought requires us to formulate some kind of frame or 
background for physical phenomena. Let the background be a white sheet 
to show up the phenomena, not a jazz-painted camouflage against which 
they may lie undetected. 

Case (6), in which the equivalent frames E^ , F^ consist of entirely different 
symbols, has also an important physical application. Since q is now a 
mixture of J5-symbols and J-symbols, the relation between the two frames 
is not describable by reference to the A'-frame only. If we have three such 
frames E, F, G, the transformations y Ey ,q Ei } cannot be compared, and there 
is no meaning in saying that the change from E to F is greater or less than 
the change from E to G. Equivalent frames of this kind are required when 
we deal with the properties of two similar atoms or two electrons, con- 
ceived as non-interacting. Two electrons are intrinsically similar (or equi- 
valent) but are not the same; we cannot specify different degrees or different 
kinds of not-the-sameness, as we do for equivalent space-time frames. If 
there is interaction the case is somewhat altered, and the electrons are not 
so definitely distinct; but here again the Principle of the Blank Sheet 
requires us to start with frames corresponding to non-interaction, into 
which interaction is introduced as an explicit perturbation. 



CHAPTER III 
THE RESOLUTION OF MATRICES 

3* 1 . Four -point Matrices . 

We are now going to show that fourfold matrices may be expressed as 
2?-numbers ; so that the theory developed in Chapter n has a particular 
application to matrices. 

First consider the six matrices 

a = 100 8p = Q 010 #y = 001 

1000 0001 0010 

0001 1000 0100 

0010 0100 1000 

D a =l 000 D0=l 000 JD y =l 000 

0100 0-100 0-100 

00-10 0010 00-10 

000-1 000-1 0001 

The system of nomenclature is that the suffixes a, /J, y have reference to the 
three ways of pairing four numbers, viz. 12, 34; 13, 24; 14, 23. 

We also introduce two alternative notations for the unit matrix, viz. 

S 8 = J9 S =1. 
The following results of matrix multiplication are easily verified: 

flfy = fl y , D^D^D V \ (3-111) 

8 u *=i, # a 2 =i; (3-112) 

S a D a = D a ,S a , S a ^=-Z>0AS a ; (3-113) 

with similar results obtained by permuting a, /?, y, but not S. 

The commutative properties may be summarised as follows (a, 6 = a, /?, y, 8) : 

S a S b = S b S a , D a D b = D b D a9 S a D b = (ab)D b S a , (3-12) 
where (a&) = (6a) = l if a = 8, or 6 = 8, or a = 



= 1 otherwise ( " 

The product of any number of these matrices in any order can be reduced 
to one of the sixteen forms: 

S a D b (a,6 = a,j8, y ,8). 

For we can bring all the $'s to the beginning and the >'s to the end by 
applying (3-12), and then reduce the 's to a single S and the D's to a single 
D by applying (3-112) and (3-111). If either the S factor or the D factor 
disappears, we insert the unit matrix S s or JDg to preserve homogeneity. 



3-2] The Resolution of Matrices 35 

Thus the sixteen forms constitute a complete set in the sense explained in 
2*2. If we call a linear function of them with algebraic coefficients an SD- 
number, the operations of addition, subtraction and multiplication applied 
to D-numbers will always yield /SZ)-numbers. 

3-2. Pentads. 

It follows from (3-12) that 

(8 a D b ) (8 c D d ) - (be) (ad) (S c D d ) (S a D b ). (3-21) 

Hence the condition that S a D b and 8 D d anticommute is 

(bc)(ad)=-l. (3-22) 

Let us write down the matrices S c D d which anticommute with /S a /> 8 . 
Here 6 = 8, so that (6c) = l. Hence (orf) = -1; and since a = a, we have 
d = ]8 or y. The suffix c can have any value. Hence the matrices are 

Sfy> fyDp, S ? Dp, SsDp, S a D y , ty D y , S y D Y , S^D y . (3-23) 

Selecting one of these, S 8 Dp, we find in a similar way that the following 
anticommute with it: 

S a D a , 8 a fy, S a D y , 8^, S^, S Y Dp, 8 y D yt S r D 8 . (3-24) 
Hence the following anticommute both with $ a />a and 8$ Da: 

S^Df), 8^, S y Dp, S Y D Y . (3-25) 

The first of these is the product of /S a jD s and 8$ Dp. A symbol which anti- 
commutes with two symbols necessarily commutes with their product; thus 
no further matrices can anticommute with the triad: 

S a D 8 , S B Dp, 8^. (3-26) 

It will be found that the remaining three matrices in (3-25) anticommute 
with each other, so that 



constitute a pentad of mutually anticommuting matrices. 

Dropping the superfluous 8$ and 1 D 8 , and inserting a factor i where 
necessary to make the square of the matrix equal to - 1, the pentad is 

AS af iDp, iS y D y , S a Z) y , S Y D ft . (3-27) 

These five matrices accordingly satisfy the same conditions (2-23) as a 
pentad of JS-symbols, and constitute a particular identification of E l9 J 2 , 
j 3 , ^ 4 , ES . If identified in this order they are found to satisfy (2-22). All the 
theorems of Chapter H then have an application to matrices. 

The complete set E^ can accordingly be identified with the complete set 
i (ab)* 8 a D b , the factor i being inserted when (ab) = -h 1 in order to make the 
sauare eaual to - 1. 



36 Wave-tensor Calculus [3-2 

The five other pentads can be found from the theory of JE-symbols, or 
more simply by permuting a, ]8, y in (3-27). 

A set of four anticoinmuting four-point matrices was first introduced 
into physical theory by P. A. M. Dirac in his wave equation of an electron. 
The particular matrices used by Dirac form part of one of the pentads here 
found. It was shown by J. v. Neumann that the complete set consisted 
of 16 matrices.! The complete set S a D b was first studied in this connection 
by the author. J It has been pointed out by P. du Val that a similar analysis 
had been developed in connection with the theory of Kummer's Quartic 
Surface. 

3*3. Components of Matrices. 

In 3-2 we have found a representation of the U-symbols, and hence of all 
JB-numbers, by fourfold matrices. We shall now prove the converse, viz. 
that every fourfold matrix will represent an ^-number; that is to say, any 
fourfold matrix T can be expressed in the form 

T^t^, (3-311) 

i r r 

where E l9 E 2 , E 3 , E, E 5 are the matrices (3-27). 

The meaning of (3-31 1) will be clearer if we insert the row-and-column 
suffixes a, /? of the matrices, viz. 

T^ = S^(^) aj3 . (3-312) 

Considering in succession the 16 combinations of suffixes a, j3, we have 16 
equations to determine the 16 algebraic coefficients t^ . The values of t^ are 
unique; for if there were another set of values t^, we should have by sub- 

traction 



Hence by 2-4 (6), ^-y = 0. 

As we shall presently solve these equations for t^, it is not necessary to 
stop to prove here that the condition for the existence of a solution (non- 
vanishing of the determinant of the coefficients) is satisfied. 

We call 1^ (or E^) a component of J 7 , and p a matrix suffix, as distin- 
guished from the elements T a p and the row-and-column suffixes a, /?. 

The sum of the diagonal elements of a matrix is called the spur, and will 
be denoted by {T}. 

The matrices S a D b have no diagonal elements unless a = 8. Also we see 
from the definitions of D a , JCjj, D y in 3-1 that their spurs vanish. Hence 

t Zeits.fur Physik, 48, 881 (1028). 
J Proc. Hoy. 8oc. A, 121, 524 (1928). 

The connection with the Kummer collineation group has been treated fully by 
0. Zariski, Amer. Journ. Math. 54, 466 (1932). 



3-3] The Resolution of Matrices 37 

{S a D b ] = 0, except {S 8 D B } = 4. Changing to the E^ notation, we have 

{^} = for ^=1,2,. ..15, {# 16 } = 4i. (3-32) 

Taking the spur of (3-311), we have 

T (3-33) 



by (2-395). Thus the spur of a matrix is (appropriately) four times the 
quarterspur. It is to be remembered that for a general symbolic Jr-number 
(not identified with a matrix) the diagonal sum would have no meaning. It 
is for that reason that we have introduced the quarterspur as a more general 
characteristic, not implying matrix representation. 



By (2-52) tp= -qs(^rH - H^Z 1 } ( 3 ' 34 ) 

by (3-33). We have thus an explicit formula for the components t of a 
matrix. 

Owing to the great importance of (3-34) it may be desirable to give a 
direct proof. Multiply both sides of (3-311) by E v , and take the diagonal 
sum; we have 



Now E v Ep reduces to a single symbol, whose spur vanishes by (3-32) except 
, = v. Hence , . ,. 



which is equivalent to (3-34). 

We can write (3-34) in a form which avoids the use of the symbol { }. 
Inserting row-and-column suffixes (and temporarily dropping the matrix 



Divide the symbol T into two portions each carrying a suffix, thus, 



Since the suffixes are explicitly indicated, we may rearrange the order of 
the factors, 



the suffixes being omitted after the rearrangement since they follow the 
chain rule. Thus (3-34) becomes 



(3-35) 
We call T! and T symbolic factors of T. 

In particular, if a matrix J is the outer product of two vectors 0, #*, so 
that 

J=VX*> (3-36) 

the components are . T * w / 

F -* (3-37) 



38 Wave-tensor Calculus [3-3 

The expectation value of an operator X with'respect to wave vectors 
and x* is defined to bef x = x*X0 H- x *^. (3-38) 

It follows from (3-37) that the expectation value of E^ is y^/fa. 

3*4. General Orthogonal Frames. 

The SD matrices which constitute the symbolic frame used in 3-3 are of a 
special type, called four-point matrices, since they have only four non- 
vanishing elements. By 2-7, Case (a), an equivalent frame E^' is obtained 
by the transformation jy _ ^ , ^ = ^ ^ 3 . 41 j 

where g is any non-singular JS7-number, and therefore in the present applica- 
tion any non-singular fourfold matrix. 

The matrices E^ of the new frame will not generally be four-point matrices. 
We have therefore to consider whether the results of 3-3 will apply to 
the new frame. 

The spur {E^} is invariant for the transformation (3-41). For 



yy = {E^ (3-42) 

since q'q=l. Hence the formulae (3-32) apply equally to the frame E^. 
Except in calculating the spur, no use was made of the special properties of 
SD matrices; and therefore all the results in 3-3 apply to E^'. 

In particular the formulae (3-35) and (3-37) for the components apply to 
any orthogonal frame of matrices. 

Up to the end of Chapter vi we shall (unless otherwise stated) take the E^ 
to be general fourfold matrices which satisfy the conditions for a complete 
set. We shall not specify the particular set of matrices used. This is in accord- 
ance with the relativity principle in 2-9, that there can be no absolute 
description of the reference frame initially chosen; but if other frames are 
subsequently introduced they can be defined relatively to the first frame by 
stating the components (in the first frame) of the transformation matrix q. 
The frame S a D b has served its purpose in enabling us to construct the whole 
set of equivalent frames; but it has no special significance in physics, since 
the structure of the commutation relations is common to all the frames. 
For example, we must not think of 8 a D b as being physically distinguished 
from other legitimate frames in the way that Galilean coordinates are 
distinguished from other legitimate coordinates. Its apparent distinctive- 
ness (shown in the simplicity of the matrices) is really a misfit between the 

f This is a somewhat generalised definition of expectation value. In current theory the 
term is restricted to an expectation value with respect to one wave vector #; x is then replaced 
in the formula by the complex conjugate of 0. It must not be assumed that the familiar 
properties (e.g. that the expectation value is intermediate between the greatest and least 
eigenvalues) hold for the generalised definition. 



3-4] The Resolution of Matrices 39 

physical structure and the mathematical expression of it by matrices. If we 
keep to general symbolic J?-numbers no such misfit occurs, and in that 
respect they give a closer representation of the actualities of physics than 
the matrix representation does. 

It may be asked, What do we gain by introducing matrices instead of 
general JE-numbers? Ultimately I think we gain nothing. I do not think that 
there is anything in the physical constitution of the systems to which we 
apply this calculus that is represented in the matrices and unrepresented 
in the general ^-numbers. The main justification for using a particular 
representation is that it simplifies the algebra in practical problems. Thus in 
Einstein's theory we introduce special coordinates for the discussion of the 
phenomena of the solar system, since the analysis of these phenomena would 
be intolerably difficult if we retained general coordinates throughout. We 
shall sometimes use the frame of four-point matrices in this way to establish 
results known to be invariant, which it is therefore sufficient to prove in any 
one frame of reference. On the other hand we are liable to lose valuable 
insight by premature introduction of special frames or special coordinates. 
Temple has shown that, even in so special a problem as the determination 
of the energy levels of the hydrogen atom, matrices are not required, and 
the work can be carried out with general ^-symbols; his determination 
appears to me not only more illuminating but actually much simpler as 
regards algebraic calculation than the proofs previously given in terms of 
matrices ( 9-3). In any case the use of matrix representation expressly for 
the purpose of facilitating calculation is a very different matter from its use 
in the formulation of the fundamental laws of physics. 

But, whatever the ideal course, I am here limited by the fact that I do 
not propose to reinvestigate the whole quantum theory. I must develop the 
present relativity theory up to a point at which it meets the accepted results 
of quantum theory which are soundly (if unaesthetically) established. 
These results are given in matrix representation by Dirac and others, and 
the conventional nomenclature and definitions have reference to the matrix 
representation. I must have an eye on the theory that I am steering to meet 
before I actually make contact with it; therefore it seems unwise to post- 
pone the transition to matrix representation for long. Meanwhile the 
knowledge that there is an equivalent theory in terms of general symbols 
is reassuring; for I cannot believe that anything so ugly as the multiplication 
of matrices is an essential part of the scheme of nature. 

In 2-7, Case (6), we may take the E^ to be matrices and the F to be 
general symbols, or vice versa ; then q will be a mixture of matrices and general 
symbols. Thus the use of matrix representation does not entirely cut us off 
from general symbols; the gap can be bridged by an ordinary tensor trans- 
formation. I shall be talking chiefly about matrix frames; but if you will 



40 Wave-tensor Calculus [3-4 

inscribe q on the front cover of the book and q' on the back cover then I 
am talking about general symbolic frames ! 

There are two kinds of property which at first sight seem to be expressed 
more simply in matrix calculus than in general symbolic calculus: 

(1) A matrix can or cannot be resolved into two factors. We shall call a 
factorisable matrix a pure matrix. Thus we can recognise a distinction 
between pure and impure matrices, which is not apparent in the corre- 
sponding general symbols; the " factors" of a general U-number are an 
undefined conception. But purity of a matrix is an invariant property for all 
wave tensor transformations, since the two factors (vectors) transform 
separately. There is therefore some invariant characteristic of an JE?-number 
which corresponds to the factorisability of all its matrix representations. 
This characteristic is found to be idempotency ( 5-6). Purity is expressed 
quite as easily by idempotency in symbolic calculus as by factorisability 
in matrix calculus. 

(2) In the specimen pentad (3-27) three matrices are imaginary and two 
are real. This partition persists in all pentads ( 3-5); and it is of great im- 
portance in physics, being the foundation of the distinction between space 
and time. To ascribe real or imaginary character to general symbols would 
involve something not expressible in terms of their commutability relations. 
It is to be remembered that the E^ are all square roots of - 1, whether they 
are represented by real or imaginary matrices. In the case of four-point 
matrices the imaginary matrices are symmetrical and the real matrices 
antisymmetrical (for interchange of rows and columns), and the distinction 
can be equivalently described by reference to symmetry; but the property 
of symmetry or antisymmetry is not invariant for tensor transformations. 
Matrix representation seems to afford the easiest way of expressing this 
distinction; but there would be no great difficulty in working out an alter- 
native treatment by general symbolic methods if desired. 

From Chapter vn onwards our point of view changes, and we shall gener- 
ally restrict the E^ to four-point matrices (SD matrices). That is because 
we have finished contemplating the "blank sheet" and are beginning to 
write something on it; and the property of symmetry or antisymmetry of 
the matrices of a certain frame is one of the first things that we write. 

3*5. Real and Imaginary Matrices. 

A matrix is said to be real if all its elements are real, and imaginary if all its 
elements are imaginary. If any of the elements are complex, or if some are 
real and some imaginary, the matrix is said to be complex. We shall show 
that, if complex matrices are excluded, three members of a pentad are 
imaginary and two are real. 
We first prove by a reductio ad absurdum that five imaginary matrices 



3-5] The Resolution of Matrices 41 

cannot form a pentad. Suppose then that jP lf F^F^F^ F b are imaginary 
matrices forming a pentad; this is connected by a tensor transformation 
with the known pentad (3-27) containing three imaginary matrices E l9 &>, 
E 3 and two real matrices J57 4 , JE^. By 2-7 the transformation connecting 
complete sets E^ , F^ is 

F^PE^P', P-o^^, P' = aS^, PP'-l. (3-51) 

The singular case is avoided by using an appropriate reflection of the pentad 
(3-27); for, as shown at the end of 2-8, there is at least one reflection which 
gives non-zero P and P'. 

Write P=R + iS, P' = R f + i8' 9 (3-52) 

where R, R', 8, 8' are real matrices. Then, since P'P=* 1, 

R'R-ti'S=l, R'S + S'R = U. (3-53) 

By (2-75) F l P = PE ly J^P'-P'I^. 

Hence, separating the real and imaginary parts, 



so that EI R f RE l = R'F^\ R=-R'R. 

Hence E^ R'R = R'RE^ . Similarly R' R commutes with E 2 and E. 3 . Therefore 
by 2-5 (c) it consists of components which commute with E l9 E 2 , E.^. This 
restricts it to the form R'R = a + W5 45 . (3-54) 

Since J? 45 is a real matrix, a and b are real coefficients. 

Again, separating the real and imaginary parts of F P = P/? 4 ,E^P f = P't\ , 
we have 



so that # 4 R'SEi = S'F F^ R=-S'R= R'S 

by (3-53). Therefore R8 anticommutes with # 4 . We can show similarly 
that it anticommutes with E u , jE 24 , ? 34 . This restricts it to the form 

R'S = cEt 5 =-S'R, (3-55) 

where c is real. 

By (3-54) and (3-55) 

R'P = R (R + iS) = a + bE^ + icE^ , 



Therefore 

R'R= R'PP'R = (a + 6^ 45 ) 2 + c 2 ^ 45 2 = a 2 - 6 2 - c 2 + 2a6 J fi? 45 . (3-56) 
Comparing with (3'54), we have 

a = , a 2 6 2 c 2 = a, 

so that 6 2 + c 2 = J, which is impossible since 6 and c are real. 

We can show similarly that a pentad of four real matrices F 19 F 2 , F^, F 6 



42 Wave-tensor Calculus [3-5 

and one imaginary matrix F 3 leads to a contradiction. For, denoting the real 
matrices in the standard pentad by E l9 E 2 and the imaginary matrices by 
E 3 , JK 4 , E & , we have again two matrices -F 4 , F 6 whose character (real or 
imaginary) is opposite to that of the corresponding matrices E^, E 5 , and 
the proof applies without alteration. 

No other case arises, since by (2-22) the number of imaginary matrices in 
a pentad is necessarily odd. Thus the only possible partition of matrices in 
a pentad is three imaginary and two real. 

The theorem has been generalised to matrices of m rows and columns by 
M. H. A. Newman.f If m = 2p, where p is odd, the maximum number of 
matrices in an anticommuting set is 2q+ 1; and of these q+ 1 are imaginary 
and q real. 

A case that might possibly be of physical interest is w = 16. The maximal 
anticommuting sets are then nonads with five imaginary and four real 
matrices. A nonad can be constructed as follows: The sixteen rows are 
designated by double suffixes ocj8 (a, /?=!, 2, 3, 4). Then if E^ denotes a 
4-rowed matrix correlated to the first suffix, and F^ the same matrix 
correlated to the second suffix, the outer product E^F V is a 16-rowed matrix.J 
An example of a nonad is 

iVi, iE 3l F l} iE l2 F l9 iEtF 29 iE,F 29 iE^F 29 F 39 JF 4 , F 5 . (3-57) 

It is constructed by means of a pair of conjugate triads of E matrices (see 
(3-82)). 

3-6. Determinant of an A 7 -number. 

It is useful to have before us the explicit expression for the matrix which 
represents a general JE7-number T = S ^ E^ with some standard identification 
of the matrices E^ . The following is the matrix representing T, when the 
matrices E l9 E 2y JE? 3 , E^ E^ are taken to be i8 u , iDa, iS y D y , 8^, 8 y Do as 
in (3-27). We write r u for ft u . 

r* r* 



* 6 + T 3 , 
-T 2 -T 14 , T 62 + e 23 -f 6 ~Tg, T 43 + t^ - T 16 - t 3l , 

+*5 ~ T 3J T 16~ T 35"^ T 2 ~ T 14> T 1 ~ T 42 + ^21 "" *4 > 
T 52"~'23"~^5 + T 3> T 43~^45 ~ T 15^'31> T l ~ T 42 ~ ^21 + ^4 T 16 "" T 85 "" T + T 14* 

(3-61) 

The columns correspond to the first suffix and the rows to the second suffix 
of V 

The determinant formed by (3-61) can be evaluated. It is found to be 
detr = S^2E^^ + 8S ^^ TU ^ 8S ^^^^ 6< (3 . 62) 

Here the first two terms on the right are written in single-suffix notation, 

t Journ. Land. Math. Soc. 7, 93, 272 (1932). 

J Matrices of the form S^ are treated fully in Chapter x. 



3-6] The, Resolution of Matrices 43 

and the last two terms in double-suffix notation. In the second term the 
sign is positive iiE^,E v anticommute and negative if they commute; in the 
last term the sign is positive if /A, v, a, T, X, p is an even permutation of the 
suffixes 0, 1, 2, 3, 4, 5 and negative for an odd permutation. It is understood 
that each component is written in one way only in the double-suffix nota- 
tion; e.g. 2 i may also be written as 12 , but it is not to be included a second 
time in the summation on account of the two ways of denoting it. 

The determinant of a mixed tensor is unaltered by tensor transformations. 
Thus (3*62) will be invariant when the special matrix frame used in (3*61) is 
changed to any other frame E ' 

We define the determinant of an E -number to be the function (3-62) of its 
sixteen coefficients. With this definition the determinant of an JSJ-number 
is the same as the determinant of any matrix representation of it; and 
properties of fourfold matrices which involve their determinants can be 
extended to general symbolic JB-numbers. 

It is well known that the condition that a matrix T shall be singular is 

detT^O. ' (3-63) 

Also, by a well-known theorem, for any two matrices S and T 

det(ST) = detSxdetT. (3-64) 

We see by inspection that when E^ = S a D b 

det Ep = det 8 a x det D b = 1 ; (3-65) 

and, since the determinant is invariant for tensor transformations, this 
holds for the matrices E^ of any complete orthogonal set. (The same result 
is also found directly from (3-62).) 

By (3-64) and (3-65) det (E^ T) = det T. (3-66) 

And by (3-62) det (a + bE^) = (a 2 + 6 2 ) 2 (/x ^ 16). 

Hence 

det(cos0 + ^sin0) = l (^16). (3-671) 

We generally write cos + E^ sin 6 = e** e (see 4- 1) ; hence by (3-64) 

det (Te E e ) = det T (^ 16). (3-672) 

We shall later consider transformations of the form T-+Te E n 9 n f 
T-*el B 9 *Te* E n e n, etc. By (3-672) any number of these transformations 
leaves det T invariant, if the algebraic transformation /LC= 16 is excluded. 
Further, if p, = 16 and 16 is real, the transformation does not alter the 
modulus IdetTj. A transformation which leaves |detjf| unaltered is 
called a unitary transformation. Of the 32 possible transformations e E i* 9 
(counting real and imaginary 0^ as different transformations) the only one 
which is not unitary is that given by imaginary 16 . 

These results apply to general ^-symbols as well as to matrices, the 
determinant being defined bv (3-62). 



44 Wave-tensor Calculus [3-7 

3-7. Eigensymbols and Eigenvalues . 

If X is any symbol, and is a symbol (not zero) such that 

X = oc<, (3-71) 

where a is an algebraic number, < is said to be an eigensymbol of X and a an 
eigenvalue of X . We collect here some of the most important properties of 
eigensymbols. The results are given for final eigensymbols; but there are 
in all cases corresponding theorems for initial eigensymbols defined by 
(f) X == a^. 

Iff is a polynomial function, repeated application of (3-71) gives 

/(*).*-/().*. (3-715) 

(a) If the symbol X satisfies a polynomial equation /(X) = 0, the only 
possible eigenvalues of X are roots off (a) = 0. For we have / (X ) . </> = 0, and 
therefore /(a) .< = 0. Then, since ^ is not zero, /(a) = 0. 

In particular if X 2 is algebraic and equal to ra 2 , X has only two possible 
eigenvalues m. The eigenvalues of the E^ are i. 

(b) A symbol which has an eigenvalue has no reciprocal. For if X<f> = 
and X~ 1 X = 1, we have = X" 1 (X</>) = </>. But an eigensymbol, by definition, 
is not zero. 

Hence if an ^-number or matrix T has a zero eigenvalue, it is singular, 
anddet!T = 0. 

(c) If T has an eigenvalue A, T - A has an eigenvalue 0, so that 

det(T-A) = 0. (3-72) 

Accordingly the eigenvalues A of an ^-number or matrix are the roots of 
equation (3-72), which is called the characteristic equation. For an -B-number 
or fourfold matrix the characteristic equation is of the fourth degree in A, 
and may be written 

/(A)-(A-A 1 )(A-A 2 )(A~A 3 )(A-A 4 ) = 0. (3-73) 

It is known that a matrix satisfies its own characteristic equation (Hamilton- 
Cayley theorem), so that we have 

/(r) = (r~A 1 )(r-A 2 )(T-A 3 )(T-A 4 ) = 0. (3-74) 

Equation (3-74) may be regarded as the converse of the result (a). 

The polynomial equation of lowest degree satisfied by a symbol T is 
called the minimum equation. Since every eigenvalue must be a root of the 
minimum equation (by (a)) and every root of the characteristic equation 
is an eigenvalue, the minimum equation can only differ from the character- 
istic equation if the latter has repeated roots. 

(d) To find an eigensymbol of T corresponding to one of its eigenvalues, 
say A 19 we proceed as follows. Let m (T) = be the minimum equation, and 
let sr(T)=m(T)/(J T -A 1 ). Since (I 1 -^) is a factor of m(T), g(T) is a poly- 



3-7] 



The, Resolution of Matrices 



45 



nomial in T\ it cannot vanish, because g(T) = Q would be a polynomial 
equation of degree lower than the minimum equation. Let <l> 
where x is any symbol. Then 



Hence T<j> = \<j>\ so that <f> is the required eigensymbol. 

(e) Mutually commuting matrices S, T, 17, ... have a common eigen- 
symbol, and in particular a common eigenvector. 

We form polynomials ft (5), g*(T) y g^(U) for S 9 T, U as in (d), and take 
<!> = </i(S)9*(T)<J3(U)x- Since ft (S), jr f (5T), <7 3 (#) commute, it follows as 
in (d) that is an eigensymbol of 8 and T and U. Also ft (S), ft (T), g^(U) 
are matrices, and their product is a matrix; hence if x is a vector, (f> will be 
a vector. 

(/) If X and F have a common eigensymbol <, we have (X Y - YX) (f> = 0, 
so that JC 7 7X is either singular or zero. 

(g) IfEp and E v commute, and <f> is an eigensymbol of aE^ + bE v , where 
a 2 7* 6 2 , then <f> is an eigensymbol of E^ and ^ . 

For if (aEp + &#) ^ = a^, 

we have, on multiplying by aEp bE v , 



so that < is also an eigensymbol of aE^ - 6^ . Hence is an eigensymbol of 

+ 6A;) (aB^ - 6A 1 ,), i.e. of E^ and ff, . 
(h) If J^, #, ^? mutually commute, and <f> is an eigensymbol of 



where a 2 ^ fc 2 ^ c 2 , then <^ is an eigensymbol of E^ , ^ and E a . 

By (2-36) and (2-343) JS7 a = iE fJL E v . Hence, if c' = ir, the datum is 



We obtain, on multiplication by E^, E v , 



Hence the determinant 



bEp o/E v oc Ei JS 
j. 


7 y + c / )^ = 0. 


LU 

a 


6 


c' 


a 


= 0. 


a 


-c' 


b 


a 




-c' 


a 


a 


-6 




-6 


a 


a 


c' 





This gives the possible eigenvalues a. Then, by eliminating E and E^E V 
from three of the equations, we obtain a result of the form (jE^ -f k) </> = 0. 
Hence if j ^ 0, ^ is an eigensymbol of E^ . 



46 Wave-tensor Calculus [3-7 

The singular case, when j= for every combination of three equations, 
occurs when the four minors of the fourth column of the above determinant 
all vanish. It is easily found that this condition requires that two of the 
quantities a 2 , 6 2 , -c' 2 shall be equal. This is excluded by the enunciation 

(i) If X, Y 9 Z are commuting symbols with eigenvalues x it y^, z ( 
(i=l, 2, 3, ...), the eigenvalues of any rational function f(X, Y, Z) are 
included among the quantities f(x i9 j 

We can write 



f(X, Y, Z)-f( Xi , y f , z k )={f(X, Y, Z)-f(x { , Y, Z)} 

+ {f(x t , Y,Z)-f( Xi , y } , Z)}+{f( Xi , y it )-/(* y,, z k )} 

z k ), (3-75) 



since the first bracket vanishes when X = x i , the second when Y=y it the 
third when Z=z k . Now form the product 

U{f(X,Y,Z)-f(x t ,y i ,z k )} 

i,3,k 

for all combinations of values of i, j, k. By substituting (3-75) in it, we 
express it as the sum of a number of terms containing products 

(X-x 1 Y(X-x 2 )>...(Y-y 1 )>(Y-y 2 )...(Z-z 1 )(Z-z 2 r.... 

Every term will contain a complete set of eigenvalues of at least one of the 
symbols X, Y, Z. For, if not, let there be a term which does not contain the 
factors (X x t ), (Y-y m ), (Zz tl ). But one of the factors is 



so that either (X x t ) or ( Y y m ) or (Z z n ) must appear in every term. 

The minimum equation for X is m (X) = ll t (X - xj = 0. Since every term 
contains m (X) or m ( Y) or m (Z), every term vanishes; and we have 

n {f(X, r f Z)-/(s <f y, f %)} = <>. (3-76) 

$t j * 

This is a polynomial equation satisfied by the symbol f(X, Y, Z), and its 
roots f(x i9 y i9 z k ) accordingly include all possible eigenvalues of the symbol. 
Not every root will be an eigenvalue; for example, if JC, Y, Z are fourfold 
matrices, f(X, Y, Z) will be a fourfold matrix, so that not more than 4 of the 
64 roots of (3'76) can be eigenvalues. 

Of the above results (a), (&), (d), (e), (/), (i) apply to all symbols which 
satisfy a polynomial equation. A common example of a symbol which does 
not satisfy any polynomial equation is djdx. 

t Frobenius's theorem. It holds for any number of commuting symbols; we here take 
three as a sufficient illustration. 



3-8] The Resolution of Matrices 47 

3* 8 . Paul! Matrices . 

A notation for the U-symbols based upon the conjugate triads (2-37) is 
sometimes useful.f Denote the symbols E^, E va9 E afl \ E r \, E^ 9 E pr9 which 
form conjugate triads, by 

A l9 A 29 A 3 i B l9 B 2 ,Bs. 
Then the sixteen E^ can be written as 

A a9 B 09 iA B r , i (cr,T=l,2,3). (3-81) 

Each A is the product of the other two A\ and each B is the product of 
the other two J3's. The rules of commutation are: an A anticommutes with 
an A, and a B with a B\ an A commutes with a B. 

There is one pair of conjugate triads in which all six matrices are real, viz. 

^23 > ^31 > ^12 ^45 > ^5 ^4> (3' 82) 

J5? 4 , JS7 5 being the real matrices of the pentad. The other ten matrices of the sot 
are imaginary. With this identification, the real or imaginary character of 
the matrices in (3-81) is explicitly indicated by the absence or presence of i. 
This method of constructing a complete set can be exhibited in another 
way. We apply the treatment of 2*2 to two symbols instead of four. Let 
A l9 A 2 be any two symbols which satisfy 

V- 1 ' A^^-ArAp. (3-83) 

Then, if A 3 = A 1 A 29 A 3 is an additional symbol satisfying (3-83). The 
s y mbols A 19 A, A, i (3-84) 

form a "minor complete set". Calling any linear function of them an A- 
number, the operations of addition, subtraction and multiplication applied 
to A -numbers always yield A -numbers. 

A minor complete set can be represented by twofold matrices. Let 

It-*' 0, 2 = i, ^0 -1 

-i i 10 v ' 

These satisfy C/--1, ,,-&- -W M , ( 3 ' 86 ) 

where /*, v, X are in cyclic order. Thus ^ , 2 , 3 , i is a particular representation 
of A l9 A 29 A 39 i. The matrices (3-85) are called Pauli matricea.% 

We can show, in the same way as for the E -symbols, that with this 
identification every .4 -number is represented by a twofold matrix and every 
twofold matrix can be expressed as an ^4 -number. We find also a transforma- 
tion theory for minor complete sets analogous to 2-7, viz. 



where 



f This was pointed out by G. Lemaitre. 

% Most writers employ the matrices tj u whose squares are + 1. 



48 Wave-tensor Calculus [3-8 

In Case (6), in which the ,/ are new symbols commuting with the i^, we 
have a = i, so that the transformation operator is 

P-^-K-l + W^ + kk' + W.'). (3-87) 

To reproduce the sixteen E^ we require two minor complete sets A and B. 
These may be represented by two sets of Pauli matrices ^, 0^, provided 
that the products ^O v are taken as outer products. The outer product of 
two twofold matrices is a fourfold matrix. We are thus led back to the 
ordinary representation of the ^-symbols by fourfold matrices. 

3-9. Left-handed Frames. 

The coefficients t^ of an ^-number are in general complex algebraic numbers, 
sa Y T /i + **ii For * we have a choice of two algebraic square roots of 1, 
which we shall call ij, t a . We may regard the symbols (l,i) as constituting 
an algebraic frame; we have then to distinguish two possible algebraic 
frames (1, ij, (1, i 2 ) either of which can be combined with a given symbolic 
frame E^ . 

We have hitherto used Z? 16 and i indiscriminately; but it is now desirable 
to define the structure of an orthogonal symbolic frame unambiguously by 
eliminating i in the fundamental equations. We therefore replace (2-22) 
and (2-343) by (3-91) 



where p, i/, cr, r, A, p is an even permutation of 0, 1, 2, 3, 4, 5. 

An .E-nimiber T = (r^ + iv^) E^ will involve an algebraic square root of 
- 1 denoted by E 19 which occurs in the symbolic frame, and also an algebraic 
square root of - 1 denoted by i which occurs in the coefficients. The complete 
reference frame for ^-numbers thus consists of 

(a) A symbolic frame E^ , 

(6) An algebraic frame (l,t) for the coefficients. 

jB 16 and i may or may not be the same root of 1. Absolutely it is meaningless 
to inquire whether they are the same root. But if we (arbitrarily) regard 
them as the same in one complete reference frame, we can define other 
complete reference frames in which they are opposite roots. 

A complete reference frame in which E l9 = i will be called right-handed, 
and a frame in which E u = - i will be called left-handed. 

Since E le is invariant for tensor transformations, there is no transforma- 
tion of the type F^qE^q' between right- and left-handed frames.f Their 
relation is like that of right- and left-handed systems of rectangular co- 
ordinates, which cannot be changed into one another by rotation. 

We usually treat the algebraic frame as unalterable so that a right- 
handed frame E^ and a left-handed frame F^ are distinguished by F^ = - E^ . 

t The existence of two kinds of complete sets not transformable into one another by 
relativity transformation was, I think, first recognised by 8. R. Milner. 



3-9] The Resolution of Matrices 49 

The relation of the other matrices can have various forms; we give the three 
most important. If the frames are constructed from the same tetrad 

we have by (3-91) lf 2 ' 3> 43= lf 2l 3> 4f 



(3-92) 

the other ten matrices being the same. Similarly if they have the tetrad 
E u , E 25 , E 3S , E^ in common, 

^18> ^l ^2> ^3 -^4> ^5 ""^18 "~-^l> "~^2> ~~^3 ~"^4 "" ^5 > (3'93) 

the other ten matrices being the same. Another form, obtained by giving 
alternative signs to i in (3-81) is 

F r = E r , *l=-0 if (3-94) 

where E r denotes the real matrices 2? 23 , E$ l9 B 12 , E^ E 6 , E^, and A\ the 
imaginary matrices. In this last case we change from a right- to a left- 
handed set by writing i for i in the elements of the matrices. 

We shall find later that the distinction between positive and negative 
electric charges corresponds to the distinction between right- and left- 
handed frames; so that a positive charge cannot be changed into a negative 
charge by a relativity transformation of frame. 



CHAPTER IV 

SPACE VECTORS 
4-1. Rotations. 

When an exponential contains non-algebraic symbols, it is understood to be 
denned by the exponential series. Thus 



since JSL 2 = 1. Hence 

eV=rcos0 + J^sm0. (4-11) 

In fact, so long as no opportunity for exhibiting non-commutative pro- 
perties arises, E^ is indistinguishable from i. The reciprocal of e E G is e~ E ^. 
The ordinary factorisation of an exponential e a +0 = e a .e holds only so 
long as a and ]8 commute. For example, 

e K 1 e+E S3 <t = e E 1 9 me E^ 9 e *i0+^4 96 e*i*.eM. (4-115) 

By (4-11) 

ePvflEy = (cos 6 + Ep sin 6) E v 

= E v (cos + Ep sin 0) if EH , E v commute 
= E v (cos Q Ep sin 0) if E^ , J57 y anticommute. 
Hence eP e E v = j0,eV if E^ , JS7 V commute ] 

= E v e- E 6 if JS^ , E v anticommute/ " ( ' ' 

We now consider the relativity transformations of the JS?-numbers 
describing a physical system. By (2-94) the change t^t^ of the coefficients, 
due to a rotation of the physical system relative to a fixed frame E^, is 
given by 

Let } 

which satisfies qq' = 1. Then 



J^e-*^, (4-14) 

by (4-12), where S a denotes summation of the eight terms which commute 
with E 12 , and 2^ summation of the eight terms which anticommute. As an 
example of terms anticommuting with E 12 and therefore included in S^, 
we take 1^ + 1^. We have 

a ) cos - (t l E l + t 2 E 2 ) JS? 12 sin 
+ t 2 E 2 ) cos fl - ( - ^ JS? 2 + < 2 EJ sin fl 



4-1] Space Vectors 51 

By 2-4 (6) we may equate coefficients of the same matrix on both sides of 

(4*14) Hence 

tj = ft cos 0-J 2 sinff, ^'^sintf + facosfl. (4-15) 



That is to say, the relativity transformation q = e*** rotates (^ , t 2 ) through 
an angle 0. 

Examining the other terms in S^ we find that three other pairs of com- 
ponents are rotated in the same way, viz. < 13 , t^\ t u * ^4* ^is > ^25 The remaining 
components are unchanged. 

Considering the more general relativity transformation g=e^^, and 
taking /z = 1, 2, ... 15, we obtain 15 independent rotations of this type, each 
rotating four pairs of components. We may, if we like, add the E 19 rotation, 
viz. q = e& e , q' = e~t i6 , but this does not alter T. 

Two terms which mutually rotate necessarily anticommute with each 
other as well as with the transformation matrix. We can always find a 
transformation matrix which will rotate two anticommuting terms E^ vt 
Ep a , v * z - their product E va . There is no corresponding mutual rotation of 
two commuting terms. Suppose, for example, that we try to rotate E^ and 
^23*23- Since J? 23 = iE^E lt we shall require the transformation q=e* K *& ', 
but E l and J? 23 commute with J0 45 , and therefore come under S a . Accordingly 
^ , J 23 are unchanged by the transformation. 

Pairs of components which can rotate with one another will be called 
perpendicular] pairs which cannot rotate will be called antiperpendicular. 
We have the rule: 

Matrices commute: components antiperpendicular; 
Matrices anticommute : components perpendicular. 

The term orthogonal will be understood to include antiperpendicularity as 
well as perpendicularity. 

If the components ^ are represented as coordinates in a space of 16 
dimensions, the space contains certain planes in which rotation is possible 
and certain planes in which it is forbidden. Actually the 120 coordinate 
planes consist of 60 planes of rotation and 60 forbidden planes. More 
precisely we should say that rotation in a forbidden plane is a non-relativistic 
change; although depicted graphically as a rotation, it is an intrinsic 
deformation of the physical system described by t^ . 

Rotation in a forbidden plane will be called an antiperpendicular rotation. 
It is produced by the (non-relativistic) transformation 

(4-16) 



or t 

Taking g=e* B * as before, we have instead of (4-14) 



52 Wave-tensor Calculus 

Hence pairs of terms which are not rotated by the relativity rotation are 
rotated by the antiperpendicular rotation, and vice versa. As an example of 
terms commuting with E 12 and therefore included in S a , we take 



= (t 3 E 3 + 45^45) cos + (^3^45 + ^45^3) sin 9 
= ( 3 cos + i 45 sin 6) E 3 - i ( - 3 sin + # 46 cos 

Hence t 3 ' = J 3 cos 4- i 45 sin 0, i 46 ' = 1$ sin -f i 45 cos 0. (4-17) 

The other pairs rotated by the same transformation are 

^4> *^35 ^5 ^34 ^12 > ^16' 

If alternatively we treat (4-16) as a transformation of the frame, so that 
T ' = S W> where */-, (4-18) 

the new frame S^ does not satisfy the conditions (2-23) for a complete 
orthogonal set. Its structure is intrinsically different from that of the 
standard frame E^ and the physical structure built by t^ in such a frame is 
therefore not equivalent to the structure built by the same t^ in an orthogonal 
frame. It is to be noticed that this argument that antiperpendicular rotations 
are non-relativistic changes, does not introduce the question whether (4- 16) 
is a tensor transformation. We shall see later (Chapter vn) that in some cases 
(4*16) is a rather simple tensor transformation, though it is not that of a 
mixed tensor. I think that confusion of thought has often been caused by 
failure to recognise that, although tensor calculus is an almost indispensable 
tool in relativity theory, it does not in itself imply any relativistic hypo- 
thesis. 

4- 2 . Alternative Treatment . 

The rotations of the components t^ can also be found directly from (3-35) 
or (3-37). Consider first a factorisable matrix J = 0x* = s j ft J^. By (1-461) 
and (1-462) the relativity transformation g = eHA gives 

0' = eH**M ^, x *' = x *e-k*A. 
Hence, by (3-37), 

j v ' = - lx* r vV = ~ i x *e-**A^e*VM ^. 
If JE^, E v anticommute, this becomes, by (4-12), 



where J a == JS^. Hence, by (3-37), 

3* =^ cos 9 4- j a sin 0. (4-21) 

If Ep , E v commute, the result is j v ' =j y . The components of a general matrix 
T are transformed according to the same formula, because the general 
matrix can be expressed as the sum of a number of factorisable matrices. 



4-3] Space Vectors 53 

It is tempting to combine these elementary rotations into a "general 
rotation" g = e* 2J5 A; but we have seen in (4-115) that non-commuting 
exponentials cannot be compounded in this way. This prohibition merely 
reflects the non-commutability of rotations in ordinary Euclidean geometry , 
and is not a peculiarity of the -spaee. The objection, however, does not 
apply to a combination of infinitesimal rotations. The most general in- 
finitesimal rotation, corresponding to a general infinitesimal matrix dfc), is 

g = e* d = eteV^, (4-22) 

the squares and products of dB^ being neglected, so that the question of non- 
commutation does not arise. Equivalently (4*22) may be written 

q = 1 + $d = 1 + iSB^ . (4-23) 

When the ^ are represented as coordinates in a 16-dimensional space, 
each elementary relativity rotation g = e*M/* appears as a rotation through 
an angle 0^ occurring simultaneously in four different planes. The 15 values 
of /z (excluding ft= 16) accordingly give 60 planes of rotation; and there 
remain 60 forbidden planes. It is evident that the geometry of this 16-space 
is different from that of any type of space ordinarily studied. The novel 
features are (1) the occurrence of antiperpendicular pairs of axes, and (2) 
the locking together of four rotations. We shall now consider how to repre- 
sent the components t^ in a more familiar type of space. 

4*3. Five -dimensional Euclidean Space. 

A pentad provides five mutually perpendicular components t l9 1%, t^, J 4 , J 6 . 
Any pair of these can be rotated. Moreover, the rotation matrix E 12 com- 
mutes with E 9 , 7? 4 , E 5 , so that the mutual rotation of ^ and t z leaves 3 , 4 , J 5 
unaltered. It is therefore a simple Euclidean rotation so far as these five 
components are concerned. 

If then we represent (t l9 2 > ^a> ** W as coordinates (or as components of a 
vector) in five-dimensional space, this space has the ordinary relativistic 
properties of Euclidean space, namely that simple rotation in any of the 
ten coordinate planes is a relativistic transformation. Accordingly the 
intrinsic properties of a physical system are not affected by changing its 
orientation in this space. 

Thus by limiting ourselves to a sub-space of five dimensions we encounter 
neither of the complications of the geometry of 16-space. The domain of 
(^u 2 a , J 3 , 4 , < 5 ) has the properties which we attribute to ordinary space; and 
(leaving aside the fifth dimension for the present) we may identify physical 
space-time with a continuum constructed in this way. 

This provides a linkage between wave tensors and space tensors (1*2). 
A space vector is the pentadic part of a mixed wave tensor . Ordinarily we regard 
the components of a space vector as an array (t l9 t 29 t^ 9 t 4t9 6 ), whereas in a 



54 Wave-tensor Calculus [4-3 

wave tensor they are strung together with symbolic coefficients as a linear 
expression /^ + t z E 2 -h t$ E z + 4 1? 4 + J 6 J? 5 ; but the latter mode of representing 
space vectors has long been recognised as permissible, e.g. in quaternion 
notation, so that the difference of form need not be stressed. 

This is not a hypothetical identification. In 1-2 we left the basic wave 
vector undefined, and we are therefore free to define it at this stage. We 
now define the relation of wave tensors to the ordinary space vectors of 
physics to be such as is expressed by this identification. Henceforth our 
calculus embraces both wave tensors and space tensors. 

The question remains, What is the significance of the eleven remaining 
components of the wave tensor? When the 5-vector undergoes a relativity 
rotation, e.g. in the plane t t t 2 , rotations also occur between 13 , 23 ; 14 , t^\ 
*i5 ^25 The ordinary tensor calculus provides for these locked transformations ; 
a transformation consequent on a transformation of the basal space vector is 
made automatic by assigning the appropriate space tensor character to the 
quantities concerned in it. We shall prove in 4-5 that the following is the 
required specification: 

(a) 19 2 > *3> *4> 5 is a space vector, 

(6) 12 , 13 , ... 45 is a 10-vector or antisymmetrical space tensor of the 
second rank (analogous to a 6-vector in four dimensions), 

(c) 16 is an invariant. 

The statement that (a) is a vector and (b) an antisymmetrical tensor of the 
second rank secures that the transformations of (b) are locked to those of 
(a) in such a way that the rotations of pairs of terms occur in groups of four 
as required. 

We shall call the group of space tensors (a), (6), (c) a complete space vector, 
or (if no ambiguity is likely to arise) simply a space vector. We have therefore 
the simple relation 

Mixed wave tensor = Complete space vector. (4-31) 

In dealing with space vectors we recognise only ten relativity rotations, 
viz. the ten rotations of five-dimensional space. The wave tensor had 15 
relativity rotations (or 16, if we count the algebraic rotation). The five 
extra rotations intermingle the two space tensors (a) and (6); for example, 
the E 2 rotation rotates the component ^ of (a) with the component t l2 of (b). 
Ordinarily (a) and (6) will be recognised in physics under different names, e.g. 
velocity and spin, and a transformation which does not keep them distinct 
could only be pictured at the cost of abandoning the usual representation 
in space and time. We rather miss the point of the interpretation of mixed 
wave tensors as space vectors, if we go on to attribute to the space vectors 
transformation properties outside those which the name ordinarily suggests. 
There is no advantage in introducing space vectors so generalised that we 



4-4] Space Vectors 55 

can no longer " see " them in space; for if we are not going to use the space 
picture, we have to fall back on their analytical description as mixed wave 
tensors, and it is more appropriate to refer to them by that name. We may 
therefore agree that the name space vector implies that only the ordinary 
relativity rotations of the space are under consideration; and that if it is 
desired to include the transformations which transcend space-time repre- 
sentation the proper designation is wave tensor. 

This question of nomenclature arises because in 7-6 we shall define an 
important entity which behaves as a mixed wave tensor for the ten rotations 
of 5-space, but not for the other rotations . Thus it may be properly described 
as a space vector, but not as a mixed wave tensor. Except in this connection 
it is not necessary to emphasise the distinction, and I shall generally use 
the two names as equivalent. 

4*4. Four -dimensional Spherical Space. 

We have now to consider why the actual world is four-dimensional, although 
the analytical theory (which we have reason to think is appropriate to the 
physical world) seems to provide for five dimensions. The answer is not 
difficult to find. Space-time is four-dimensional, but it is not flat (Euclidean) ; 
and if we restrict ourselves to Euclidean geometry, we require at least one 
more dimension to represent its curvature. 

Our actual space-time with its irregular curvature, due to local gravi- 
tational fields, requires a ten-dimensional Euclidean space for its repre- 
sentation; but we do not arrive at this complexity until we provide for a 
more varied content of the universe than can be represented by a simple 
wave tensor. Any problem which is mathematically simple is necessarily 
highly idealised; and the simple wave tensor with which we commence our 
study can carry us only a little way towards actuality. We have found 
that it is capable of rotation, without intrinsic change, in ail planes in 
five dimensions. This leads immediately to the conception of an entity 
distributed over a continuum which is a hypersphere (four-dimensional 
manifold) in a Euclidean space of five dimensions. We must start with this 
simplified space-time, and watch the more complex characteristics of actual 
space-time grow out of it as the theory develops. 

We shall first show that it is the hypersphere (and not the five-dimen- 
sional space in which it is represented) which constitutes the physical 
continuum. This is because the hypersphere is a locus of equivalent 
points, whereas the points in five-dimensions are not generally "equivalent " 
points. 

Let the coordinates of a point P be (^ , t 2 , J 3 , $ 4 , 5 ). Any of the ten rotations 
in five dimensions will (if it displaces P) carry P to a new position P 1 on 
the same hvneranhftTe ahniit tha oricrin. Than t.hft -nninfja P P' nr 



56 Wave-tensor Calculus [4-4 

structed according to the same specification in different but equivalent 
frames, and are therefore equivalent points (2-9). None of the relativity 
transformations provides any connection between points which are not on the 
same hypersphere. If we consider a point P" such that PP" is normal to the 
hypersphere, P and P" are not equivalent points. The transformation 
(0, 0, 0, 0, J 6 ) -> (0, 0, 0, 0, t$) is not a relativity transformation; it is in fact 
an antiperpendicular rotation with matrix A 7 16 . 

Thus displacement normal to the hypersphere involves, either a different 
construction, or the same construction in a frame which differs intrinsically 
from the original frame. Taking the latter view, the difference between the 
two frames is evidently a difference of scale-constant. Our five-dimensional 
picture accordingly represents change of scale-constant or gauge by dis- 
placement in a fifth coordinate, normal to the four ordinary coordinates 
used to represent position. A similar graphical representation of change of 
gauge is used in "Projective Relativity". 

It is the essence of the elementary conception of space and time that all 
points of it are equivalent. A particle is not intrinsically different because it 
is at a different point of space or because it is contemplated at a different 
time. It is true that in later developments regions of space and time are 
distinguished from one another by varying curvature; so that the space-time 
background is no longer a "blank sheet". But we have to trace the origin 
of these distinctions, and must begin with the blank sheet. This is provided 
by the hyperspherical continuum of equivalent points; the curvature is 
uniform, and every part of the continuum is precisely similar to every other 
part, as the conception of " equivalence" implies. 

Thus although the transformation theory introduces the conception of 
five dimensions, it is clear from the start that there is an absolute distinction 
between the displacements lying in the four-dimensional hypersphere and 
displacements in the fifth dimension normal to the hypersphere. Whereas 
the former changes of a system are of the type which we conceive as dis- 
placement in physical space and time, the latter are only " displacements " 
in the sense in which we regard any change of quality of a system (scale, 
temperature, entropy, etc.) as a displacement of the representative point 
in a graph of that quality. 

Let us now confine attention to a region of the hypersphere small enough 
to be treated as flat, and let t b be the coordinate normal to this region, the 
coordinates in space-time being (t l9 t 2 ,t 3 , J 4 ). We now exclude the rotations 
^15 > ^25 ^35 > ^45 > because they would carry us away from the small region 
considered and mix the scale-coordinate 5 with the coordinates recognised 
in our ordinary outlook. There remain the relativity transformations 
corresponding to the matrices 



4-5] Space Vectors 57 

These constitute the transformations admitted in special relativity theory, 
viz. three rotations in space and three Lorentz transformations. We now 
consider how the general mixed wave tensor will appear from this outlook 
the ordinary outlook on which current nomenclature is chiefly based. 
The mixed wave tensor is found to break up into 

(a) a vector t l9 t 2 , J 3 , ? 4 , ^ 

(6) another (adjoint) vector 15 , J 25 , J 35 , J 4B , | (4-42) 

(c) a 6-vector * 23 , * 

(d) two invariants 

The proof that (a) and (6) undergo the same transformations for any of the 
rotations (4*41), and are therefore "tensors of the same kind", is obvious. 
The proof that (c) is a 6-vector is given in 4-5. 

Thus in four dimensions the complete space vector is composed of two 
vectors, a 6-vector, and two invariants. These remain distinct in any of the 
six internal rotations of the 4-space which transform the 4-space into 
itself. Naturally the use of the four-dimensional picture presupposes that 
we are not interested in the transformations which cannot be represented 
in the 4-space; if we have occasion to refer to them we must revert to one 
of the other modes of description. 

If we are right in our belief that all physical phenomena are analysable 
into ultimate elements described by wave vectors and their combinations, 
it follows that an ordinary space vector cannot occur alone; it is part of a 
group of allied space tensors (4-42). The other members of the group may, 
of course, have zero value, but that is not the same as being non-existent. 
This is one of the ways in which the new outlook enriches the earlier theory. 

4-5. Proof that (J 23 , / 31 , / 12 , 14 , J 24 , J 34 ) is a 6-vector. 

If Xp , y^ (p = 1, 2, 3, 4) are ordinary space vectors X, Y, their vector product 
consists of six quantities (x^yv ^y^). Then any set of six quantities which 
transform according to the same law as (x^ x^y^) is called a 6-vector. 
In matrix form the vectors are 
X = E 



Their matrix product X Y is found by direct multiplication (using E^ = - 1 , 
EpE,- -tf,ig to be XY = - 

where (xy) is the scalar product, and 



to be - (4-51) 



). (4-52) 

Apply a transformation q representing a rotation or Lorentz transforma- 
tion in four dimensions. Then (xy) is invariant, and 



58 Wave-tensor Calculus 

Hence by (4-51) S' = X'Y' + (xy)=qXYq' + (xy) 



(4-53) 

Let # -# 2 3*23 + ^31 '31+- +^34*34^ ( 4 ' 54 ) 

Then, since the constituents (a), (6), (c), (d) of the complete space vector T 
transform separately, we have ^, _ jj , (4-55) 

By (4-53) and (4-55) U and S obey the same transformation law for rotations 
and Lorentz transformations in four dimensions. It follows that their 
components t^ v and (x^y v - x v yj obey the same transformation law. Hence 

Vv 0*' v== lj 2 > 3 > 4 ) is a 6 - vector - 

We can show similarly that t^ v (ft, v = 1, 2, 3, 4, 5) is a 10-vector in 5-space. 

It will be seen that the matrix product XT, used in the present calculus, 
is not the same as the ordinary vector product X x Y, but is X x Y - (xy). 
When two vectors are perpendicular their scalar product (xy) vanishes, and 
the matrix product is then the vector product. 

The product of two complete space vectors is a complete space vector. 
For the product W UV transforms according to the law 

W = U'V'=: qUq'qVq' = qUVq' = qWq'. (4-56) 

But, as we have seen above, the matrix product here employed is a type of 
combination which has no exact counterpart in the ordinary theory of 
vectors. 

4-6. Volume Elements. 

The expressions E^dx^ E 2 dx 2 , E^dx 39 E^dx^ are the wave tensor notation 
for four space vectors (displacements) along the four rectangular axes in 
space-time. Their product constitutes a volume element dV I234k . Writing 
dr = d 



by (2-22). Since the jEJ's anticommute, dV^^ is antisymmetrical in its four 
suffixes as in the ordinary tensor calculus. f The factor iE 5 corresponds to 
V - g. We are restricted to rectangular coordinates (one of them time-like), 
and therefore g is always 1; but we deviate from the usual theory which 
assumes that the radical indicates the algebraic square root. We take 
instead a matrix square root iE 5 . 

Consequently, notwithstanding that we use rectangular coordinates, the 
distinction between vectors and vector densities does not wholly disappear. 
In the ordinary_calculus we have, corresponding to a vector A , a vector 
density A^V g. Here V g = iE b , so that corresponding to a vector 



we have a vector density 
i(E 
t Mathematical Theory of Relativity, 49. 



4-6] Space Vectors 59 

Of the two adjoint vectors in (4-42) one represents a vector and the other a 
vector density. We shall show later ( 5-8) how it is possible to decide which 
is which. 

More generally we define the volume element contained by four space 
vectors ePa?^, &*x^ A^x^ d*Xp, which do not coincide with the axes, to be 
the permanent 



^ 



1234 = 



E 2 d l x 2 , 



(4-62) 



E 2 d 2 x 29 
E 2 d*x 2 , 
E 2 d*x 29 

A permanent is expanded like a determinant except that all the terms are 
given positive sign. The factors are arranged in the order of the rows from 
which they are taken; since the rows correspond to the four vectors, this is 
the natural order of the factors when the vectors are multiplied in a given 
order. The anticommutation of the four J5?'s provides an alternation of sign 
which converts the permanent into a determinant; and (4-62) reduces to 

dF 1234 = E l E 2 E 3 E^ dot (dXp) = iE & det (dxp). 

In our later developments three-dimensional vector densities are more 
important than the foregoing four-dimensional densities. The volume- 
element of three-dimensional space, contained by vectors E i dx 1> E 2 dx 29 
E 3 dx 3 along the coordinate axes, is 

^E 2 E^dw = lEudw, (4-63) 



where dw = dx^dx^dx^ . 
If T is any space vector 

TdW l2 z = iTEudw = Sdw, (4-64) 

where S = iTE^. (4-65) 



Then 8 is the three-dimensional vector density which corresponds to the 
vector T. We shall later meet with S in another connection, in which it is 
called the strain vector associated with the space vector T. 

By (4-65) we can find the componehts s^ of 8 in terms of the components 
tp of jP. The following is the complete scheme of relation: 



The components of S associated with real matrices are given in the first line 
and those with imaginary matrices in the second line. It will be seen that 
the latter correspond to the 10-vector part of T. 



60 Wave-tensor Calculus [4-7 

4-7. Wave Functions. 

Consider a wave vector which is a function of a complete space vector T. 
That is to say, to every one of a continuous set of space vectors T there 
corresponds a wave vector which we denote by ^ =/ ( T), or by/ (^ , J 2 , . . . J 16 ) , 
where t l9 t 2 , ... are the components of T. 

It is usual to distinguish certain of the variables t^ as coordinates, the others 
being parameters of the wave function /. Denoting the coordinates by # , 
and the parameters collectively by a, the notation is changed to $ a =/ a (x^). 
Thus the original wave function is treated as a continuous set of wave 
functions \f> a distinguished by parameters a, each of which covers the domain 
of coordinates # . 

In most practical applications the domain of coordinates is space-time; 
and a wave function has the form ^=/(# l5 tf 2 > a? 3 , o? 4 ), or f(X), where 
Z = ^ 1 x 1 + E^x^ + E^x 3 + B^XI. The function then specifies a wave vector 
field in space-time. 

The function /is necessarily double-valued. To show this, let 



I.e a wave- vector field, r, 0, </> being polar coordinates. If we apply a tensor 
transformation jsae**" 06 to the frame, we refer the vector field to a new 
coordinate system. By (4-15) the effect of the transformation on the co- 
ordinate system is that the point whose azimuth was <f> in the old system 
has an azimuth ^ -f a in the new system. The new coordinates are therefore 

r' = r, 0' = 0, ^' = < + oc, # 4 ' = z 4 . (4-72) 

The transformation of ^ is $' = e^w a ^. (4*73) 

Let the vector field in the new coordinate system be 



,s 4 ). (4-74) 

Then, by (4-73), I" (r, 0, <f> + a, xj = e^JP 1 (r, 0, * 4 ). (4-75) 

Now let a = 277. Since e*" 11 " = 1, 

P' (r, 0, ^ + 27T, Xt)=-P (r, 0, ^ ajj, (4-76) 

or in rectangular coordinates 

/' (*i, x*, ZB, ^ 4 )= -70*1, ^2, ^3, aj- ( 4 ' 7 7) 

But, since the axes after rotation through 2?r are the same as they were 
originally, */>=f(x l9 x 2 ,x 3 , xj and 0=/ ; (x l9 # 2 , # 3 , a; 4 ) are both expressions 
for the wave-vector field in the original coordinate system. Thus iff is a 
double-valued function f of rectangular coordinates. 

Since this result is of vital importance, we must try to remove any 
doubt as to its meaning. Let I! be a particular frame of rectangular 
coordinates and time, and let P be a particular point in that frame. 



4-7] Space Vectors 61 

Suppose first that iff is single- valued; so that, referred to the frame and 
at the point P, it has the value ift and no other; in particular 0^ ^r . 
When we vary the frame or the point considered, iff changes; but if after 
such variation we return to the frame S and the point JP, ifr must return 
to iff Q . For example, if we rotate the frame in any plane in space, after 
a rotation 2ir we come back to the frame S again, so that iff is again . 
But if is a wave vector, the law of transformation of wave vectors 
requires that, when the space-time axes are rotated through 2?r, iff shall 
change continuously from ift a to iff Q . Therefore the single-valued if/ which 
we have been considering cannot satisfy the transformation law of a 
wave vector. 

The double-valuedness of / becomes of practical importance when (as 
usual) the vector field is defined over a region which includes all azimuths <f>. 
For then, on following the point P round a circuit back to the initial 
azimuth, we may find ourselves on the opposite branch of / from that on 
which we started. The return may be either to the opposite branch or the 
same branch ; either type of connection satisfies the condition of continuity 
of ift in the region over which it is defined. 

We note accordingly (for future reference) that a symbol iff^ (a = 1, 2, 3, 4), 
defined to be a single-valued function of rectangular coordinates over a 
domain which includes or encircles the origin, cannot be a wave vector. By 
"encircles the origin" we mean that a circle having the origin as centre 
can be drawn in it. 



CHAPTER V 
THE SIMPLE WAVE EQUATION 

5* 1 . Invariant Equations . 

In order that the laws of physics may be independent of the choice of frame 
(among the equivalent orthogonal frames) they must be expressible as 
tensor equations. In wave-tensor calculus, the simplest non-trivial tensor 
equation is of the form ^ i __ Q / 5.11 \ 

where H is a mixed wave tensor and $ a covariant wave vector. Then (5-11) 
is a vector equation Hj*iltp = Q equivalent to four algebraic equations. 

It may be anticipated that the simplest, and presumably the most 
fundamental, laws of physics will have this form. Alternatively, regarding 
(5-11) as a definition rather than a law, it is an appropriate means of intro- 
ducing a wave vector and relating it to the ordinary space vectors of 
physics. For we have seen ( 4-4) that a mixed wave tensor H is constituted 
of space tensors which will presumably be recognised as such in our practical 
observations; but there is no such "projection" of ^ into a space-time 
representation, and its connection with the ordinary space tensors of physics 
can only be expressed indirectly by an equation such as (5-11). 

It is appropriate to introduce simultaneously a contravariant wave 
vector x*, satisfying x *ff = 0. (5-12) 

We may expect that $ and #* will occur symmetrically in physical theory. 

Thus far our argument has been that if ever nature condescends to 
simplicity, equations of the types (5-11) and (5-12) will figure in her scheme. 

Before the birth of wave mechanics the systematised part of physics was 
wholly described by space vectors and tensors. Wave mechanics introduced 
a new kind of entity 0. It was introduced in the way here proposed by a 
"wave equation" in which the coefficients were the ordinary space-tensor 
quantities of physics. The original ifi of Schrodinger did not satisfy the 
relativity requirements of atomic physics; but in 1928 Dirac introduced a 
/r with four components, which satisfied an equation invariant for the six 
relativity rotations of space-time, although the invariance was not of a kind 
contemplated in the usual tensor calculus. Our anticipated fundamental 
equation turns out to be a form of Dirac 's equation.f 

t The form given by Dirac (Quantum Mechanics, 2nd ed., p. 255, equations (9) and (10)) 
is the equivalent strain vector equation, which we shall obtain in (7'73). Dirac further 
postulates, as a "reality condition", that the two wave vectors are conjugate complex 
quantities. Our reality conditions are determined directly from relativity principles in 
Chapter vi, and do not impose this restriction. 



5-2] The Simple Wave Equation 63 

The mixed wave tensor H used in Dirac's equation is limited to five com- 
ponents. One component has dropped out through special choice of axes, 
permissible when as usual the region contemplated is small enough to be 
treated as flat. Apart from this the truncation is significant, because the 
general mixed wave tensor T cannot be reduced to Dirac's special form // 
by any choice of axes. We shall account for this limitation in 5-4. 

We shall derive Dirac's equation according to the principles which we are 
developing in 5-4; but we shall first examine its elementary properties 
looking ahead to see the theory which we are about to meet. 

Dirac's equations are Hi/t = Q, x*JF/ = 0, (6-13) 

where H = E l p 1 + E 2 p 2 + E 3 j) 3 + ff 4 P 4 ~ m (5- 14) 

and E l9 E 2 , J5 3 , E constitute a tetrad. We call H the Jiamiltonian.^ Since 
// is required to be a mixed wave tensor, its components form space tensors 
in accordance with the specification in (4-42), namely (p l9 p 29 ^ 3 , 2> 4 ) is a 
space vector and the quarterspur m is an invariant. 

Conversely if, following Dirac, we construct a hamiltonian // out of a 
"momentum vector " (p l , p 2 , p* , p*) and an invariant mass m by the formula 
(5-14), H will be a mixed wave tensor; and therefore the wave equations 
/fy = 0, x*// = will be tensor equations which continue to be satisfied 
when any of the six relativity transformations of space-time are applied. 

Thus the invariance of Dirac's equation for relativity transformations, 
which was a novel kind of invariance from the point of view of ordinary 
tensor calculus, is an elementary consequence of wave-tensor calculus. 

5-2. Properties of Dirac's Equation. 

The equation H^ = shows that H has an eigenvalue 0, and hence that it is 
singular ( 3-7 (6)). It has a pseudo-reciprocal 

H'^Eipi + Etpz + Ezpz + Etpt + m. (5-21) 

For, multiplying by (5-14), 

##'= -Pi 2 -P2 2 -P* 2 -P* 2 ~ 2 . ( 5 ' 22 ) 

The product contains no non-algebraic terms, and therefore vanishes by 
(2-66). 

For a physically real momentum vector (p l , p 2 , p 3 , jp 4 ), p l , p 2 , Pz are real 
and PI is imaginary. Let ft = ^ ? (6 . 23) 

f In classical theory the hamiltonian is the expression for the energy ( tp 4 ) in terms of 
the momenta p l9 p %9 p 9 and coordinates. If, following the usual relativistio view, time is 
treated on the same footing with the other coordinates, the hamiltonian is correspondingly 
defined as the expression for the proper energy m in terms of p l9 p 29 p 99 p 4 and the co- 
ordinates. This would make the hamiltonian strictly T+m, but we shall use the term without 
regard to an additive constant. 



64 Wave-tensor Calculus [5-2 

so that ft is the real time component, i.e. the energy or mass. The vanishing 
of (5-22) gives rf- A -- A -- A -- A '. (5-24) 

This identifies w with the proper energy (or proper mass) corresponding 
to the momentum vector. Thus m must be real. 



By (5-24) p Q = (mt+pf+pf+pf)*. (5-251) 

Usually p l9 2h> Pz are small compared with m, and we then have the 
"classical " approximation for the energy 

P Q = m + (Pi* + ft 2 +ft 2 )/2w. (5-252) 

The general solution of the wave equations is found as follows. Let ^, o>* 
be arbitrary four-valued quantities. Since HH' = H'H = 0, we have 



Thus ^ = #'<, x* = w*77' (5-26) 

are solutions of (5-13). Inserting row-and-column suffixes, these become 



Since ^ lf <^ 2 , <^ 3 , < 4 are arbitrary coefficients, the general value of a is a 
linear combination of four elementary solutions # al ', /7 a2 ', ^T a3 ', // a4 ', i.e. the 
four rows of the matrix //'. Similarly the four columns of H' are the elemen- 
tary solutions for # Since H' is singular, its determinant vanishes, and 
therefore only three of the rows and three of the columns are linearly 
independent. There are therefore not more than three independent solutions 
in each case. 

For example, use the matrix representation (3-61). The special form (5-21) 
for H' gives 

-i#'= pz-im Pl +p Q ps 

Pi-Po -ft -fin -#3 

-ft ft-iw PI-PQ 

ft Pi+p Q -p 2 -im. (5-28) 

Any row gives a solution for ^ and any column a solution for x*- It is not 
necessary to choose the solutions in a corresponding way.f For example 
we might take 

^ = (ft~im,p 1 +ft, 0,ft), x* = (0, -ft,ft-im,Pi+ft) 
as the pair of wave vectors constituting a solution of the wave equation. 
t Our treatment here differs fundamentally from that of Dirac. See footnote, p. 62; 



5-3] The Simple Wave Equation 65 

5-3. The Stream Vector. 

Let iff, x* be solutions of the equations H$ = 0, #*/? = 0; and let 

X * = J = S^. (5-31) 

Multiply the wave equations by initial x*^ia an( l final ^12 0> respectively; 
we obtain 



Hence, subtracting, j 

so that, by (3-37), j*Pi-PiJ* = 0- (5-321) 

Again, multiplying by initial x*^i5 an d final # 15 0, and subtracting, 



(5-322) 
so that ^'5 = 0. Also multiplying by initial x*^i an d final E^, and adding, 



(5-323) 
The results (5-321), (5-322), (5-323) give 

3\ _ fa _ Jz _ Ji _3& _ V IB / K oo\ 

X lO'uOJ 

Pi P* Ps P* m ' 

The wave equation can therefore be written in the equivalent form 

(EJi + EJ 2 + #J 3 + ^7 4 J 4 - JP M J 16 ) ^r = 0. (5-34) 

We call (Ji,J2,j* 9 Ji) the stream vector. The whole set of sixteen j^ is the 
complete stream vector. We have here proved that the stream vector is equal 
to the momentum vector except for a numerical factor. Multiplying the 
complete stream vector by the same factor we obtain the complete momentum 
vector. 

We regard this correspondence of the stream vector and momentum vector 
as a coalescence which occurs in the peculiarly simple system here studied. 
In more general physical systems they are not so closely connected. Accord- 
ing to the definition of the momentum vector usually adopted in quantum 
theory and reached later in this book, the components p^ are not necessarily 
algebraic quantities; they may be matrices or general symbols. On the 
other hand the components^ of the stream vector are necessarily algebraic 
quantities. 

The result (5-34) leads us to a new view of the wave equation. Consider 
a physical system described by a pure (i.e. factorisable) wave tensor /, or 
by the equivalent set of space tensors. We are not given the whole set of 
space tensors, but only one of the space vectors (j^ , j 2 , ^ 3 , jj together with 
the two invariants ^5 ( = 0) and j lB . We cannot therefore determine definitely 



66 Wave-tensor Calculus [5-3 

the factors of J; but our data are sufficient to limit them to certain possi- 
bilities, viz. they must be solutions of (5-34) and of the corresponding 
equation for #*. 

The wave equation is therefore an equation for determining the possible 
factors of a wave tensor, which is only partly known. 

At this point we must try to make clear a difference in our attitude towards 
wave mechanics from that which appears to be usual among quantum 
physicists. It will probably be agreed that wave mechanics is a method of 
analysis, not a theory of phenomena. The waves have no objective exist- 
ence; we invent them as required in solving our problems. In the present 
treatment we have found that any space vector can be expressed as a wave 
tensor. Ordinarily it is not a pure wave tensor; but it can be represented as 
a sum of pure wave tensors, which are then resolved into their wave-vector 
factors. If the space vector is a function of the coordinates, the wave-vector 
factors become "wave functions". In this way wave functions appear in 
connection with any characteristic of a system which is described by space 
vectors. It is therefore ambiguous to speak of the wave functions of a 
system; we should rather speak of the wave functions associated with some 
specified tensor of the system. Reference to the wave function or the wave 
equation of a system leaves us in the same state of conjecture as if reference 
were made to ' ' the tensor of the hydrogen atom " or " the equation of the sun ' ' . 

I am not cavilling at expressions, whose meaning is doubtless made plain 
either by the context or by custom. My point is that when wave analysis is 
our standard procedure when the ordinary tensor calculus is replaced by 
wave-tensor calculus we shall introduce new wave functions as casually 
as we introduce new tensors. The domain of physics treated in this book is 
for the most part different from that which has occupied the attention of 
writers on pure quantum theory. Sometimes our wave functions will 
coincide with theirs; sometimes they will differ. We find it well to maintain 
a certain amount of contact in order to utilise well-known results; but in 
principle we do not bind ourselves to use the wave functions that the 
quantum physicists have discussed. Remembering that the introduction of 
wave functions is merely a factorisation, we must obviously retain freedom 
to employ factorisation whenever it is useful. 

The reader must therefore be prepared to find here a greater elasticity in 
the definition and use of wave functions than he has been accustomed to. 

5*4. The Wave Equation as an Identity. 

If we represent E^E^E^E^ E 5 by the special pentad of matrices (3-27), 
it is not difficult to prove by straightforward verification that 



5-4] The Simple Wave Equation 67 

where is any four-valued quantity. Here, as usual, E^ is the four-valued 
quantity formed by chain multiplication, and (E^)^ is one of its four 
components, i.e. (E^) QL ^(E^) QL ^. 

Any other pentad E^ is obtained by a transformation E^ = qE^q- 1 . Let 
$' =g^; then E'^qE, so that 



Hence, multiplying (5-41) by q aoi q T p, we have 



it** 5 

S <JV*%(Wr- (# 16 f U# 16 0')r==0. (6-42) 



We can choose ty arbitrarily; because (since q is not singular) the corre- 
sponding ^ = y 1 ^' can be employed in (5-41). 

Thus the identity (5'41), verified for a particular pentad, is true for any 
pentad of matrices whatsoever. 

Let x* be another arbitrary four- valued quantity. Multiply (5-41) by 
initial x (inner multiplication). We have by (3'37) 



where 0* J %j E . The result is therefore 



or (ttiJi + E 2 h + # 3 J 3 + Et Jt + EsJ 5 - JP 16 j le ) = 0. (5-43) 

We can show similarly that 

. (5-44) 



Except that the term in j 5 is included, these are the wave equations as given 
in (5-34). They are here obtained as an identity satisfied by any two wave 
vectors 0, x* and their outer product J. 

We see that Dirac was right in restricting his hamiltonian to the above 
terms, instead of employing the sixteen terms of a complete space vector. 
By omitting j & he restricts the equation to stream vectors which have zero 
component normal to space-time; otherwise his equation is a perfectly 
general one satisfied by the factors of any pure wave tensor. The hamiltonian 
H is part of the complete stream vector J, except that the sign of j 16 is 
reversed. It must not be supposed that the components of J (other than 
j 5 ) which do not appear in H are zero. 

On the other hand Dirac's postulate that i/ and x* (or rather a quantity 
<* easily derived from x*) are conjugate complex quantities would restrict 



68 Wave-tensor Calculus [5-4 

J to very special forms. The restriction has to do with certain special 
applications, and is inappropriate in general theory. 
Multiply (5-43) by initial x* and apply (3*37). We obtain 

Ji 2 +J2 2 + Js 2 + J 4 2 + J 5 2 ~ Jie 2 - 0. (5-45) 

Again, multiply (5-43) by initial x*^& an( l a Ppty (3-37). We obtain 

JJlS +J2J2S +J3JM +JJ*5 = 0. (5'46) 

These, and the corresponding equations obtained by substituting other 
pentads, are relations satisfied identically by the components of a pure 
wave tensor. There are, of course, no such relations between the components 
of a general wave tensor T which is not stated to be factorisable. 



Let P a =j- 1? (a = 0, 1, 2, 3, 4, 5; M ^a). (5-47) 

/i=0 

For fixed a, the matrices E^ form a pentad. We therefore call P a a pentadic 
part of J. There are six pentadic parts which overlap, so that 

P a P a + qsJ = J. (5-48) 

The pentad which we have been using corresponds to <x = 0, and the wave 
equation (5-43) can be written 



But since the proof holds for any pentad, we have more generally 

CP.-tfieW^O (5-491) 

or equivalently P^iff = (qs J) iff (5-492) 

for all six values of a. 

From the present standpoint the use of the wave equation is to determine 
the factors of a pure wave tensor J. It seems to be generally true that in 
physics we determine a factor iff, not because its value is of particular im- 
portance to us, but because that happens to be the most convenient way of 
ascertaining that a factor exists, i.e. that J is pure. For example, the wave 
function iff of a hydrogen atom is investigated primarily because the mere 
existence of such a function imposes certain conditions on the hamiltonian 
(which is part of J), and these conditions determine the energy levels of the 
atom. It would probably be difficult to solve the more complex problems 
of quantum theory without evaluating iff, but since the observables of 
physics are always space tensors and therefore derived from wave tensors 
of the second (or higher) rank, the wave vector factors must ultimately be 
recombined. 

In the present case (which is perhaps too elementary to be typical) the 
conditions for purity of J are expressed directly by the equations (5-45) 

*<M\\ n.nd t.Viprp is Tin npari t,n pvalnat.** t.ViA fnnf.nra 



5-5] The Simple Wave Equation 69 

5*5. Standard Forms of Pure Wave Tensors. 

Equation (5*492) asserts: 

A factor of J is an eigensymbol of every pentadic part of J, and the eigenvahte 
of a pentadic part is qs J. (5-51 ) 

Consider an antitriad E^, E ar , E^ p (2-36). Each pentad contains one and 
only one member of an antitriad. Hence, in the expression (E^ v + E^ + EX P ) m, 
each term is a pentadic part and has eigenvalues im. Accordingly the form 

J^(E tLV E <jr E^ + E^)m (5-52) 

will, if the signs are properly chosen, satisfy the condition (5-51) that the 
eigenvalue .of every pentadic part is equal to the quarterspur. It turns 
out that four of the combinations of sign make J factorisable, and four 
do not. For example, take. the + sign for the first two terms; then 



Hence, if /*, v, a, T, A, p is an even permutation of 0, 1, 2, 3, 4, 5, 



so that the + sign must also be taken for the third term in order to satisfy 
(5-492). It is then easy to verify that J is factorisable by working out the 
factors in a particular matrix representation, or by testing it for idempotency 
according to the theory given in the next section. 
Accordingly our result is that 

J = (E^ + E m + EX P + j0 16 ) m (5-54) 

is a pure matrix if //,, v, or, T, A, p is an even permutation. Any two of the first 
three terms can be given negative sign, since this is equivalent to reversing 
the order of their suffixes and the permutation remains even. 

Any non-degenerate pure wave tensor can be reduced to the standard 
form (5-54) by a relativity transformation J' = qJq'. We first make a trans- 
formation so that one of the components, say J 5 , becomes zero. Then, by 

(5'45), 



Since J is non-degenerate, J 16 ^0. Hence the vectors (j l9 j 2 , fa, jj and 
Ois * ,?25 J35> ^45) have the same non-zero length, and by (5-46) they are at 
right angles. We can therefore choose two of the axes in four dimensions to 
coincide with them; we then have 



l = J25 = Jie > 32 = h = 4 = 15 = 35 = 

Applying (5-492) with <x = 0, 5, 3, we now have 

#1J10 = ih**l>> E U>3tot = ijltfr (^31 hi + ^32 J 

(5-552) 

Thus is an eigensymbol of the two commuting symbols E 19 E 25 and there- 
fore of their product iE u . It is therefore an eigensymbol of ( jE7 31 j 31 -f E Z2 j 32 ) ; 



70 Wave-tensor Calculus [5-5 

and the eigenvalue must be zero because (E 3l j 3l + E 32 j 32 ) anticommutes 
with 27 34 . Hence the third equation of (5-552) breaks up into 



so that ./si =#32* JH* 

Hence j 31 (E 3l iE^) $ 0; or, multiplying by E 9l , J 31 ( - 1 iE^) = 0. Now 
cannot be an eigensymbol of Jff la , because it is an eigensymbol of E^ which 
anticommutes with E 12 . Hence 

Jn = 0, J 32 = 0. 

Similarly we find J 41 = 0, j" 42 = 0. 

The pentad <x= 1 then gives (E^ + E^j^if/^ij^. Hence, by the first 
equation of (5-552), ^ 12 = 0. All the components are now accounted for, and 
J reduces to (E l -f E% 5 + E M + E I6 )j^ , which is of the required standard form. 

To obtain a standard form for a degenerate pure wave tensor we proceed 
as follows. Let J = 0x* be a degenerate pure wave tensor (J 16 = 0), and let 
jvr be a component which does not vanish. Then E m J is a pure wave tensor, 
since it has factors E w and #* ; and it is non-degenerate since its quarterspur 
is E w (E^j^) = j v . Hence E W J can be reduced to the form (5-54) by a 
relativity transformation. We take therefore 



E v cannot be E^, E OT9 E^ or B 16 , since J would then be non-degenerate; 
but it can be any other U-symbol. Taking E w = E^ , we obtain 

J=(E va + E^iE VT + iE^)m. (5-56) 

This is the standard form for a degenerate pure wave tensor. 

5*6. Idempotency. 

A symbol J is said to be idempotent if J 2 = J. 

To normalise an JS?-number we multiply it by an algebraic factor so as to 
make the quarterspur J. If it is represented as a matrix (so that a spur 
exists) we normalise it by making the spur 1. It is, of course, impossible to 
normalise a degenerate ^/-number. 

We shall show that a necessary and sufficient condition for a non- 
degenerate matrix to be pure is that it shall be idempotent when normalised. 

(5-61) 

Let J = ^x* be a normalised matrix so that 



spur J= 

Then J 2 =^X*^* = ^- 1 -X* SS= ^ 

so that the condition is necessary. To prove that it is sufficient, let T be a 
matrix satisfying ^^ spur y =L ., 



5-6] The Simple Wave Equation 71 

Any matrix can be expressed as the sum of a number of vector products; 
therefore let .... (6-63) 



Here the suffixes a, b, c, ... distinguish different vectors, the row-and-column 
suffixes being omitted as usual. We write A^ for the scalar product Xa*^* 
so that the product *l* a Xa*'l l bXb* reduces to -^a&^aXb*- Then 

T i -2 f -(^-l)^Xi.* + ^*a* + ^^X-i* + --0. (5-64) 
Corresponding to the four suffixes of x this gives four linear equations satis- 
fied by the vectors $ a , & , */t c , .... Using any one of these equations to give 
the value of a in terms of the other ^'s, we can eliminate ^ ft in (5-63) and 
so reduce by one the number of vector products on the right-hand side of 
(5-63). Repeating the process, we reduce the number of vector products one 
by one. 
The procedure fails if the coefficients of ifj a vanish in all four equations, 



But we can then use (5-65) to eliminate x* in (5-63), and the number of 
vector products is again reduced by one. 

The reduction can be continued so long as there are any non-vanishing 
coefficients in (5*64). When all the coefficients vanish so that 

A a a ~ AM ^ ACC ~ = *> A a fr = **ba ^ = ", 

no further reduction is possible. We then have, by (5-63), 

spur T = spur $ a x a * + spur fa x & * + spur $ c x* + 



But spiir T= 1, so that there can be only one term on the right-hand side. 
That is to say, T is the product of two vectors. 

A pure matrix is necessarily singular. This follows from 3-7 (6), since the 
idempotent condition, 7 2 J = 0, gives eigenvalues and 1. A singular 
matrix is not necessarily pure. 

If the square of a matrix is 1, the question sometimes arises whether it 
has to fulfil any other condition in order that it may be a member of a com- 
plete orthogonal set. The most commonly occurring combinations whose 
squares are algebraic are triadic and pentadic expressions; these can serve 
as individual members of a new complete set. But it has been pointed out 
by D. E. Littlewoodf that we can also form combinations of antiperpen- 
dicular matrices whose squares are algebraic, and these cannot be members 
of a complete set. We find by direct multiplication that the square of 

(5-66) 



f Journ. Land. Math. Soc. 0, 41 (1934). 



72 Wave-tensor Calculus [5-6 

is 1. Of the eight possible combinations of sign in an antitetrad, four yield 
factorisable matrices, as we have seen; the other four give matrices whose 
squares are algebraic. 

The quarterspur of (5-66) is \i, whereas the quarterspur of E^ is or i. 
Since the quarterspur is invariant for the transformation F fJlf = qE tl q f 9 there 
can be no such transformation connecting (5-66) and E^. Therefore (5-66) 
cannot be a member of a complete set. We shall call an expression of the 
form (5-66), or reducible to it by a relativity transformation, a compact 
U-number. 

I have not as yet found any physical application for compact JS7-numbers; 
but perhaps others will be more successful. They surely must have an im- 
portance of some kind, possibly in the theory of radiation or even in the 
theory of the nucleus subjects which we do not seriously attempt to treat 
in this book. 

5-7. Spectral Sets. 

We suggested in 5-4 that the wave vector was investigated in physics, not 
for its own sake, but because the existence of factors imposes certain in- 
variant conditions on the stream vector and on the hamiltonian which forms 
part of it. We may now go a step further, and say that the condition which it 
is sought to impose is that of idempotency. Those familiar with the Group 
Theory of wave mechanics will recall the fundamental part played by idem- 
potent operators in selecting the "pure" states of a statistical ensemble. 
Consider the wave tensors 

J fl =-ii (tf 



We have found in (5-54) that these are pure. Since the quarterspur is 
Ji7 16 = J, they are normalised. Hence they are idempotent, as can be 
verified by direct multiplication. We can also verify that their products are 
zero. They accordingly satisfy 

A set of operators satisfying the conditions (5-72) is called a spectral set. 
Here the set consists of four operators only. The more familiar examples of 
spectral sets in physics includean infinite number of operators. For example, 
let J\ denote the operation of selecting light of wave length A from a source 
of light represented by $; thus the light of wave length A existing in the 
source is represented by J A ^. If we repeat the selective operation J\ on 
J A ^, it makes no difference; hence J^ = J\. The symbol ^e^ denotes the 
operation of selecting wave length A' out of light already selected as being 



5-8] The Simple Wave Equation 73 

of wave length A; the result is obviously zero. Further, selecting every wave 
length in turn and adding the results, we reproduce the original source of 
light; hence S A JA is equal to the "identical operator" 1. The selective 
operators of spectral analysis therefore fulfil the equations (5-72), which 
ensure that they are idempotent, non-overlapping and exhaustive. 

As G. Temple has pointed outf it is equations of the form (5-72) which 
directly embody the physical conception of a "pure constituent". The 
mathematically convenient criterion of purity, namely factorisability of 
the operator in matrix representation, should be regarded as derived from 
(5-72) rather than vice versa. 

This suggests a new approach to the theory of the representation of 
phenomena by -E-symbols. We can regard the matrices E^ as introduced 
by a spectral analysis of entities represented by algebraic numbers (in 
particular, probability distributions or densities) into four pure constituents 
given by (5-71). This point of view is developed in 13-0. 

5*8. The Complete Stream Vector of a Particle. 

Consider a particle in spherical space-time. A classical particle is described 
by two 4-vectors, namely a position vector and a velocity vector. In five- 
dimensional representation the position vector is the radius of space-time 
which passes through the particle; the velocity vector is at right angles to it 
and lies in the four-dimensional hypersphere. 

If we take axes such that the position vector is in the E & direction and the 
velocity vector is in the E direction, the two vectors reduce to single com- 
ponents j 5 and .7*4 . Let us treat them as components of a single wave tensor. 
There is, of course, no compulsion to combine them; there is no unique 
definition of the wave tensor of a particle, any more than in ordinary rela- 
tivity theory there is a unique definition of the tensor of a particle; and it 
would be legitimate to investigate a tensor representing position only or 
velocity only, if desired. But we shall try to find a tensor, called the complete 
stream vector, which comprises both. 

If the complete stream vector is pure, it must have two more com- 
ponents besides j" 4 andj'g. We may take it to be 

(^41 + ^50 + ^40 + ')> ( 5 * 81 ) 

which is of the standard form (5-56) with p, v, a,r = 5, 4, 1, 0. The additional 
terms define an axis in the three-dimensional space, which is in some way 
characteristic of the particle. This axis, which in (5-81) is taken to be in the 
E l direction, is called the spin axis. 

The question now arises whether in ascribing a complete stream vector J 
to the particle we should take (5-81) to be the actual vector J or the vector 

t "The Physical Principles of the Quantum Theory", Proc. Roy. Soc. A, 138, 479. 



74 Wave-tensor Calculus [5-8 

density J . iE 6 . This is answered by the Uncertainty Principle, which asserts 
that a particle cannot have exact position and exact velocity simultan- 
eously. Thus our combination of a position vector and a velocity vector will 
not apply to a discrete particle, but describes an element of its probability 
distribution. We must therefore take (5-81) to be a vector density, so that 

t JE 6 = (E^ + EM + iE M + iE^) a. (5-82) 

From this we obtain J = (E n + E 19 + JB7 46 + E ol ) a, (5- 83) 

which is of the standard form (5-54) for a non-degenerate pure tensor. 

The direction of the spin axis is shown by the term E ol , or equivalently by 
the term E 23 which gives the plane of the spin. The velocity vector, which by 
our choice of axes is in the time direction, is represented by the term E^, 
this being the matrix of the rotation which would displace the particle in 
the time direction. Instead of a position vector, we have an invariant E 19 . 
The "position" of the particle is therefore invariant for all relativity rota- 
tions; this is only possible if we represent the particle as an entity uniformly 
distributed throughout the hypersphere of space-time. This is in agreement 
with the uncertainty principle; for we have ascribed an exact velocity 
vector J57 46 to the particle, and therefore its position is entirely indeterminate. 

The attempt to assign a combination of position vector and velocity 
vector to a particle breaks down, as the uncertainty principle foretells. The 
position vector H7 60 in (5-82) defines, not the position of the particle, but the 
position of an element of its probability distribution selected for con- 
sideration. 

We define an elementary particle to be an entity whose characteristics are 
completely specified by a complete stream vector of the type (5-83), so that 
it can be represented by simple wave vectors 0, x*~ It has exact momentum 
but indeterminate position. This would perhaps more usually be called an 
elementary state of an elementary particle; and it is contemplated that a 
number of elementary states may be superposed forming a wave packet 
which has approximate position and momentum. It is to be remembered, 
however, that the properties of observational significance are relations to 
other elementary particles or combinations of particles, and not the primitive 
relations to a symbolic frame summed up in (5'83). We must not be in too 
great a hurry to identify our formulae with those employed in the practical 
applications of quantum theory. 

By (4-65) the three-dimensional vector density or strain vector corre- 
sponding to J is S^iJEto* For the special wave tensor (5-83), we find 

S=-J. (5-84) 



CHAPTER VI 
REALITY CONDITIONS 

6*1. Distinction between Space and Time. 

In relativity theory the interval between two point-events is defined by its 
square ds 2 . In Galilean coordinates 

ds* = dt* - dxf 



the velocity of light being taken to be unity as usual. By the use of anti- 
commuting symbols, the square root can be expressed in rational form; thus 
we may write 



On squaring, the product terms cancel owing to the anticommutation, and 
we have (rfg)2== _ dx *_ dx *_ dx *_ dx * = ds * 

if a; 4 = it. The algebraic square root ds is the eigenvalue of ds. Since (6- 1 1 ) 
is the space vector, or displacement, between the two points, our conclusion 
is that the interval is the eigenvalue of the displacement. 

It is of fundamental importance that, since a pentad contains three 
imaginary and two real matrices, we cannot cover more than three dimen- 
sions with matrices of the same real or imaginary character. If then we use 
the imaginary matrices E l9 E 2 , E% for the three similar space dimensions, we 
have to use a real matrix E for the fourth dimension of the physical con- 
tinuum of equivalent points. Thus the distinctive character of the fourth 
dimension (time) is already foresJiadowed in the constitution of the pentads. 

For real phenomena x 1 , x 2 , x 3 are real, and a: 4 ( = it) is imaginary. Since 
E 19 E 2 , E 3 are imaginary and E^ is real, the vector interval ds is a wholly 
imaginary matrix. 

A matrix which is wholly real or wholly imaginary will be called mono- 
thetic. Two matrices are homothetic if both are real or both imaginary, and 
antithetic if one is real and the other imaginary. The distinction between 
space and time is comprised in the statement that ds is a monothetic matrix. 

In Schrodinger's wave mechanics certain "Hermitic conditions" were 
imposed in order that the mathematical expressions should correspond to 
real physical phenomena. It seems to have been generally assumed that 
the reality conditions of Dirac's wave mechanics must be of the same 
Hermitic form, although several writers have pointed out the difficulties 
arising from this assumption. In my own developments, I abandoned 
Hermitic conditions at the outset; it seemed illogical to retain them in an 
analysis which recognises sixteen different square roots of 1. We shall 



76 Wave-tensor Calculus [6-1 

determine the reality conditions of the present theory independently. By 
relativity considerations it is possible to determine them uniquely. 

For a formal treatment, it is best to begin by considering the reality 
conditions for rotations. Corresponding to a matrix E^, we can consider 
two antithetic rotations, E^Q^, E^iu^, where 0^ and u^ are real. Only one 
of these will be admissible for physically real phenomena. Now "physical 
reality" is an invariant property; therefore our reality condition must be 
such that it is invariant for all relativity rotations of the frame of reference, 
provided that these rotations are themselves physically real and therefore 
satisfy the reality condition that is being considered. 

For trial suppose that a rotation q = e^ 6 ^ is physically real, if E^O^ is 
imaginary. Then q is in general complex. Consider another rotation 
q l = e \E v Q v ^ w h ose matrix E V 9 V is imaginary and therefore satisfies the pro- 
posed reality condition. If now we apply the physically real rotation q to the 
frame of reference, E v 6 v -^q(E v 6 v ) q'. IfE^ anticommutes with E v , q (E V 8 V ) q' 
is complex, so that q t no longer satisfies the reality condition. Thus the 
proposed reality condition is non-invariant, and must be rejected. 

Accordingly the reality condition for a rotation is that q must be real. 
Then the matrix E V Q V of the rotation q l is real, and remains real when it is 
transformed to qE v 6 v q' by the real rotation q. 

The essential point in the argument is that the matrix of a relativity 
rotation can be transformed by applying another relativity rotation; so that 
any proposed reality condition is employed twice over and its self-consist- 
ency is thereby put to a test. In other words we have to secure that the 
physically real rotations constitute a Group. 

Considering the most general rotation in four-dimensional space-timef 
g = exp I (E 23 2 s + E 3l 31 + E l2 e i2 + Euiu u + Euiuu + E^ (6-12) 
the condition that q is real requires that 23 , 31 , 12 , ^ 14 , % 24 , ^ 34 shall all be 
real. An E u rotation gives, as in (--15), 

Xi = x l cos (iu u ) - 4 sin (m 14 ), x = x l sin (iu u ) + # 4 cos (iu u ), 
or, if # 4 = it, 

Xi = x l cosh w u + 1 sinh u u 9 t f = x l sinh % 4 + 1 cosh w 14 , (6-13) 
so that the rotation is hyperbolic (Lorentz transformation). Thus the rela- 
tivity transformations in four dimensions consist of three circular rotations 
and three Lorentz transformations u in agreement with experience. 

As a by-product we see that the real quantity concerned in the Lorentz 
transformations is t = xji, so that a 4 is imaginary (ifx l9 x 2 , x 3 are real). We 
have thus a deductive proof of the result, already noticed, that (6-11) is a 

t We here exhibit the six elementary rotations collected together for reference. As 
explained hi 4*2, it would be necessary to restrict them to infinitesimal rotations if it were 
intended to apply them simultaneously. 



6-2] Reality Conditions 77 

monothetic matrix. It is a matter of convention that it is imaginary, not 
real. By taking x l9 x 2 , x 3 real, so that # 4 is imaginary, we conform to the 
convention of ordinary relativity theory which assigns real measure to 
time-like intervals. But we might equally have taken x l9 x 2 , x$ imaginary 
so that # 4 would be real; ds is then real for space-like intervals. Thus the 
compulsory reality conditions are: 

For rotations (in four dimensions): q is a real matrix, 
For intervals or displacements: ds is a monothetic matrix, 
Alternatively we may measure an interval by its vector density 



) 

V 

ix, j 



Then by (6-11) -ids^E^dx^E^dx^ + E^dx^ + E^dx^. (6-15) 

Since E 5 is real, ds is antithetic to ds; and with the usual convention ds is 
real and ds is imaginary. 

At this stage it is well to review the progress of our theory of space and 
time. We have shown that, starting with a basal wave vector ^r, it is possible 
to construct a continuum of "equivalent" points which forms a four- 
dimensional hypersphere in five dimensions. In Chapter iv we showed that 
this continuum has the local isotropic quality of ordinary space-time in 
that rotations in any of its coordinate planes are relativity rotations; but 
we did not there discriminate between circular and hyperbolic rotations. We 
have now confirmed the resemblance in greater detail by showing that the 
relativity transformations of this theoretical continuum consist of three 
ordinary rotations and three Lorentz transformations; or equivalently that 
one of the four dimensions is antithetic to the other three. 

Further, referred to this continuum, the basal vector $ has the trans- 
formation properties of Dirac's $, and in fact satisfies identically an equation 
identifiable with Dirac's wave equation. 

We must remind ourselves, however, that this is no more than the embryo 
of the actual macroscopic space-time of our experience. In the next section 
we shall find a very significant difference which shows the need for intro- 
ducing further developments in due course. 

6* 2 . Translations . 

Considering the neighbourhood of a particular point P on the hypersphere, 
we take as usual the coordinate x 5 to be along the radius at P, so that 
#1, #2 #3 #4 are rectangular coordinates in space-time. This coordinate 
system is necessarily local, for it is impossible to construct an extended 
system of rectangular coordinates in a curved space. We must therefore 
restrict ourselves to an infinitesimal region around P. 

The transformation q = e^ E ^ gives a rotation of the hypersphere in the 
plane x l x^ and therefore displaces P in the x l direction. Thus infinitesimal 



78 Wave-tensor Calculus [6-2 

translations of a point along the four axes in space-time correspond to 
relativity rotations with matrices U 16 , E 2b , JS? 35 , U 45 . 

The recognised relativity transformations in space-time, viz. four trans- 
lations, three rotations in space and three Lorentz transformations, make 
up the ten relativity rotations in five dimensions introduced in 4'3. The 
customary approximation which treats space-time as flat hides the fact that 
translation is a form of rotation, viz. rotation about the centre of curvature 
of space-time. 

The general translation in space-time is accordingly given by the trans- 
formation 9==exp KEuOu + E n O n + E n O u + EM. (6-21) 

If we apply the previous reality condition that q is real, the expression in 
real variables is 

q = exp (E 15 iu lb + J 25 m 25 + E^iu^ + E^), ( 6 ' 22 ) 

since E 15 , jE 25 , J57 35 are imaginary, and E^ is real. As in (6-13) the u's give 
hyperbolic rotations, and 45 gives a circular rotation. This means that the 
continuum is open in three (space) dimensions and closed in one (time) 
dimension the reverse of the conditions in actual space-time. 

The root of the trouble is that, when space-time is pictured in five dimen- 
sions, these dimensions are four space-like and one time-like; for the radius 
of curvature represented in the fifth dimension is space-like. But the matrices 
associated with these dimensions are three imaginary and two real. The real 
matrix J57 5 is incongruously associated with a space-like dimension. 

Some writers on relativity have mooted the possibility that the world 
might have negative curvature. (Negative curvature refers to the Gaussian 
curvature which is proportional to l/B 2 . If B 2 is negative, B is an imaginary 
length, or equivalently it is a t'me-like radius.) But the proposal has been 
treated from the point of view of formal mathematics, and can scarcely be 
entertained in physical theory .f 

We shall find the significance of this incongruity in the next section. But 
we shall first consider how it is to be reconciled with the invariance of reality 
conditions. To conform to the actual universe it will be necessary to admit 
that the exponent of q is real in (6-12) but imaginary in (6-21); that is to 
say, the general rotation is to be resolved into = Q l + 2 , where 

i = + ^8i + ^iiii + ^i4^4 + ^i4^+^^il 

2 = JS? 15 15 + -^25^25 + -^35 ^35 + ^45 *^45 > I 

with the reality condition that X is real and 2 is imaginary. 

What then becomes of the invariance of physical reality? The saving 
circumstance is that we have restricted ourselves to an infinitesimal region 

t The objection to unclosed space arises from quantum theory rather than from relativity 
theory. It will become obvious as our investigation proceeds. 



6-2] Reality Conditions 79 

round a point P\ and at the point P the direction denoted by E 6 (normal to 
space-time) is absolutely distinguished from the other directions (4-4). 
Thus the separation of into X and 2 is defined in an absolute way; and 
we can attach different reality conditions to the two parts without coming 
into conflict with relativity principles. 

Let us now apply a rotation 1 ' + 2 ' to a rotation 1 + 2 . Under the 
transformation g 1 = ei e i', x and 2 transform separately. By the above 
reality conditions q l is real, and therefore ft!?/, ?i0 2 ?i' remain real and 
imaginary respectively, and continue to satisfy the reality conditions. 
Under the transformation g 2 = e ie ', Q l and 2 are not kept separate. The 
real matrix l is transformed into a complex matrix g 2 1 ? 2 / ; but it is easily 
verified that the imaginary terms in it are of the form 2 so that they 
satisfy the reality conditions. Similarly the imaginary matrix 2 becomes 
a complex matrix ? 2 2 g 2 / ; but the real terms introduced are of the form 
X , and satisfy the reality condition for lB Thus the reality conditions 
are found to be self-consistent, and the rotations which satisfy them form 
a Group. 

We have confined ourselves to an infinitesimal (initial) region because it 
is only locally that we can pick out a unique direction, absolutely different 
from other directions, to distinguish as J? 5 . But is it sufficient to treat an 
infinitesimal region? A relativity rotation cannot be real at some points 
and unreal at other points of space-time; we ought therefore to show that 
the same rotation tested in two different localities fulfils the proposed 
reality conditions at both or neither. It turns out that this self-consistency 
is assured automatically. If we examine the reality condition at another 
point of space-time, we must first make sure that it is a real point; and the 
test of its reality is that it is equivalent to the real point first considered 
that it can be transformed into it by a relativity rotation which satisfies the 
reality condition. Thus we have to lay down the reality condition for rela- 
tivity transformations of a single initial point (given as real) before we can 
decide what values of the coordinates represent real points, i.e. points which 
have real equivalence to a point known to be real. Thus we have not to show 
that our adopted reality condition is self -consistent over a predetermined 
real domain; the domain over which it is self-consistent is ipso facto the 
domain of real points. 

We begin with one real point P the observer, in fact, for all reality is 
relative to him. We determine a group of infinitesimal transformations which 
we define as the physically real transformations. These transformations 
applied to P give all the neighbouring real points. By a process of continua- 
tion (using at each stage the local reality conditions) we reach the more 
distant real points. Any of these real points can now be taken as the initial 
real point; proceeding from it we shall by the same construction obtain the 



80 Wave-tensor Calculus [6-2 

same real domain. This is a consequence of the group property of the 
transformations used. 

We have shown that the reality conditions (6-23) satisfy relativity re- 
quirements; but we have still to explain why these are the conditions pre- 
vailing in nature, rather than the simple condition that Q l and 2 should 
both be real. 

6*3. Neutral Space -time. 

We shall find later that the entity represented by a simple wave tensor is an 
electric particle (proton or electron). Up to the present we have dealt with 
one particle only; we have not yet developed the apparatus of description 
for two or more particles. It is therefore rather premature to talk about 
space-time, which is a macroscopic conception presupposing vast numbers 
of particles. What we have investigated is a preliminary geometrical frame- 
work in which the characteristics of a single elementary particle are repre- 
sented vectorially . This framework is the genesis of macroscopic space-time; 
and it already contains two of the most essential features: (1) four dimen- 
sions, of which one is antithetic to the other three, and (2) a radius of 
curvature. 

For clearness we shall here anticipate some of the changes which will take 
place in converting this preliminary conception into the space-time of 
macroscopic experience. By introducing great numbers of particles the 
radius of curvature will be greatly increased, relatively to the linear scale 
characteristic of a single particle (commonly recognised as the wave length 
of its Schrodinger waves). The increase of the population will give scope for 
irregularity, and the hypersphere of space-time will be distorted by gravi- 
tational fields. But the difference that chiefly concerns us now is that there 
will be a balancing of positive and negative particles. A universe containing 
only one particle, and therefore only one sign of charge, is lop-sided com- 
pared with a universe containing equal numbers of positive and negative 
charges. It will readily be imagined that if matter consisted of electrons only, 
the enormous negative potential would so alter the world that we should 
require a different type of space-time to frame the phenomena. It is just 
such a world in miniature that our theory of a single charged particle 
imitates. 

The Riemannian space-time of Einstein's general relativity theory is 
derived from the extensional relations of neutral matter. The test bodies 
whose behaviour determines its characteristics scales, clocks, moving 
particles, light waves are electrically neutral. Moreover, all macroscopic 
matter in our experience is to a very high approximation neutral; if the 
proportion of electrons to protons differs from equality by one in a billion, 
the electric charge expressed in ordinary units is stupendous. 



6-3] Reality Conditions 81 

The test bodies are used to measure intervals, and hence determine the 
tensor g^ v characteristic of neutral space-time. Ideally their use is restricted 
to regions where there is no electromagnetic field. In practice this is not a 
very important restriction, because the strongest electromagnetic fields 
encountered in nature correspond to a very trifling lack of balance of dis- 
tribution of positive and negative charges. But in an electromagnetic field 
too strong to be neglected, it would be impossible to use the test bodies to 
determine g^ v , because there is no agreed definition as to how g^ v is related 
to the indications of the test-bodies when electromagnetic fields are present. 
Each investigator has defined it according to his own fancy. No experi- 
menter would undertake to make accurate measurements of length in an 
intense electromagnetic field; statements about lengths and distances 
within the field are inferences from observations made outside the field, and 
depend on the theoretical formulae employed in calculating the inference. 
Current scientific literature abounds in rival formulae (usually embodied in 
an "action-principle") for making such calculations, each corresponding to 
a different definition of g^ v in regions where there is no means of determining 
it by direct observation.! 

We must therefore consider even at this early stage the main difference 
between the space-time of a universe whose content is neutral and the 
space-time of a universe containing a particle or particles of one sign only. 
We shall call the space-time of a universe containing only positive particles 
positively saturated; if containing negative particles only, it is negatively 
saturated. 

In 3-9 we have shown that there exist two kinds of frame, right- and 
left-handed, which cannot be transformed into one another by a relativity 
rotation. If we take two vectors T = S ^ B^ , T' = S ^ F^ , where E^ , F^ are 
respectively right- and left-handed, these cannot be transformed into one 
another by a relativity rotation. Clearly a distinction of this kind is 
required to discriminate between positive and negative charged particles. 
We shall therefore provisionally identify T and T' with the complete stream 
vectors of positive and negative particles subject, of course, to confirma- 
tion by detailed examination of the Resulting properties. 

Take right- and left-handed frames related as in (3'92), so that 

^18> ^5> ^15 ^25> ^35> ^45 = ~"^16> ~^5 "~^15 ~^25> ""^35 ""^45- 

(6-31) 
For the second particle we have T' = S t^ F^ = X t^ E^ , where 

^18 *5 W* ^25' W> ^45 ^ "~"'l6> ~~^5> "~^L5 ~~*25 ~^35 "~^45 (6'32) 

the other ten components being the same for both particles. Hence, con- 
sidering the position vector, the coordinates of the two particlesj are 

t See 13-4. 

j More strictly coordinates of elements of their probability distributions ( 5*8). 



82 Wave-tensor Cakulus [6-3 

respectively (^ , 1 2 , (, , 4 , J 6 ) and (^ , J 2 , * 3 , 4 , - J 6 ). If we take as usual the E 5 
axis normal to the small region that we are considering, so that t , t 2 , t 3 , t^ 
are infinitesimal, the particles are in antipodal regions in five dimensions; 
but the ordinary four-dimensional point of view is that they are at the same 
point of space-time (^ , t 2 , tf 3 , J 4 ), f and that the centre of curvature of space- 
time is in opposite directions along the normal, according to which particle 
we are considering. 

There is nothing surprising in this. The two particles are contemplated as 
alternatives; for our analytical machinery is not yet capable of dealing with 
two particles at once. By Einstein's theory the curvature of space-time 
depends on its contents. So, if we start by considering an infinitesimal region, 
we do not know how the region will continue until we have decided on the 
contents of the region. It will curve away from the tangent plane more or 
less strongly according as the density is high or low. We now see further that 
it will curve to one side of the plane or the other side of the plane according 
as we insert an elementary positive or negative charge (or a probability there- 
of). This is not noticed in Einstein's theory, because that is a theory of macro- 
scopic matter, which (even if it is electrically charged) contains equal num- 
bers of positive and negative charges to an extremely high approximation. 

It will be seen that the idea first suggested by the five-dimensional picture 
that the positive and negative particles are at antipodal points on a fixed 
sphere is rather misleading. They are not simultaneously present; and the 
sphere is not fixed until we have decided which is present. 

Our result is that, considering an infinitesimal region of space-time, if it 
contains part of the probability distribution of a positively charged particle, 
the radius of curvature will be in one direction of the normal, say # 6 ; and if 
it contains part of the probability distribution of a negatively charged 
particle, the radius of curvature will be in the opposite direction # 5 ; or 
briefly, if the region is positively saturated the radius of curvature will be 
in the direction # 6 , and if negatively saturated it will be in the direction 
# 5 . Suppose now that it is neutral having equal probability of positive 
or negative charge. Two possibilities are open. The curvature may be zero, 
or the radius of curvature may be in the direction of the imaginary normal 
ix b . These are the only alternatives which have neither a positive nor a 
negative bias. We know from ordinary relativity theory that the first 
alternative is incorrect; neutral matter does involve a curvature of space- 
time. Therefore we must accept the second alternative; the radius of curva- 
ture is in the direction ix 6 . That is to say the radius of curvature changes its 
character from time-like to space-like when we pass from a positively or 
negatively saturated world to a neutral world. 

t We have chosen the relation (3-92) rather than (3*93) or (3*94) in order that our formulae 
may refer to particles at the same point of space-time. 



Reality Conditions 83 

This result will be confirmed in our subsequent investigations. But I 
think it is clear, even at this early stage, that it is forced upon us. 

We can now understand the origin of the curious reality conditions in 6-2. 
The whole trouble was that the radius of curvature was associated with a 
real matrix E$ indicating time-like character. But the radius of curvature is 
time-like in the positively or negatively saturated world to which the most 
elementary formulae relate. It is in the transition to a neutral world that 
the space-like radius of curvature becomes substituted for a time-like 
radius. In a positively saturated world the relativity transformations 
q = e* satisfy the simple condition that is a real matrix. The V^l never 
gets a footing in so simple a world. According to this theory a positively 
saturated world is open in its space dimensions and closed in its time 
dimension. Saturation is so remote from the conditions of our actual ex- 
perience that we certainly cannot bring forward any observational evidence 
to the contrary. We can well imagine that the stupendous electrical repul- 
sions would be sufficient to burst any closed space. 

Since the vector E 5 x & towards the centre of curvature has in neutral 
space-time a value antithetic to its value in electrically saturated space- 
time, it is antithetic to the four-dimensional position vector 

X = E^ + E 2 x% + EzXz + E^x^ . 

The rotation about a centre at J 5 # 6 , which produces a displacement dX, is 
given by dX/E 5 x^ . Thus the rotation 2 is antithetic to its value in electric- 
ally saturated space-time, and is therefore imaginary. This gives the reality 
conditions (6-23). The change from a time-like to a space-like radius of 
curvature from saturated to neutral space-time is the source of the V^l 
which is such an inescapable feature of quantum formulae. 

6*4. Kinematical and Electrical Matrices. 

For general developments it is more convenient to take right- and left- 
handed seta Ep, Fp connected by (3'93),f so that 

^>^i6= -JV-^i6 (ft =1,2, 3, 4, 5), (6-411) 

the other ten matrices being the same. Let 

N^N^^iE^iE^. (6-412) 

We shall call N^ a neutral set. As a further generalisation we define a macro- 
scopic set Hp by ^ *-*%, A^, (6-413) 

the other ten matrices being the same. Then A is a scale constant which may 
be real, imaginary or complex. Real values correspond to electrically 
saturated space-time, and imaginary values to neutral space-time. An 

t We shall see in 6-5, that (3-92) gives the association of vector densities and (3-03) the 
association of vectors. 



84 Wave-tensor Calculus [6-4 

ordinary field of positive or negative potential could therefore be repre- 
sented by a complex value of A, the real part being very small compared 
with the imaginary part; this kind of representation is only used when we 
are pursuing a unified field theory, involving non-Riemannian geometry. 
More usually macroscopic electrical fields are represented as perturbing 
influences superposed on Einstein's neutral space-time. 

We may express a complete space vector in terms of macroscopic com- 
ponents, namely T^^m^M^^m^E^XL.m^E^ (6-42) 

where S fc refers to the ten unchanged matrices and S c to the six changed 
matrices in (6-413). We shall call the unchanged matrices the kinematical 
matrices, and the matrices whose sign is changed between right- and left- 
handed frames the electrical matrices. The electrical matrices are accordingly 

E l9 E 29 EZ, EI, E &9 J57 16 (6-43) 

and the kinematical matrices are those associated with the ten rotations in 
five dimensions. We shall adhere to this nomenclature, irrespective of the 
physical interpretation.! 

The relativity rotations in five dimensions correspond to the ten kine- 
matical matrices. We may therefore distinguish them as kinematical 
rotations, the other six relativity transformations being electrical rotations. 
As shown in 4-3 the kinematical and electrical parts of (6-42) are trans- 
formed separately by the kinematical rotations; so that from the ordinary 
standpoint they are separate vectors arbitrarily combined into one analy- 
tical expression. In combining them we can introduce a scale constant A, 
which remains invariant in the transformations. For example, in macro- 
scopic spherical space-time we may meet with two distinct but somewhat 
analogous vectors, say a velocity and a spin. We think it probable that there 
is some significant combination of these into a complete space vector, which 
will lead to a more far-reaching theory of the phenomena. But until the 
details of that theory are worked out, the combination involves an undeter- 
mined scale ratio A. If we regard the components m^ of the complete 
vector as definite and equal to the components of the two separate vectors 
as ordinarily measured, A must be embodied in the frame of our ordinary 
measurements which then becomes a macroscopic frame M^ . 

We have found representations of a complete space vector in five dimen- 
sions ( 4-3) and in four dimensions ( 4-4). There is also a representation in 
six dimensions. The components t l9 t 2 , ... t l& form a 15-vector, or anti- 
symmetrical tensor of the second rank in six dimensions. From a purely 
algebraic standpoint this is the most fundamental representation; and the 

f The matrix associated with a particular characteristic depends on whether the 
characteristic is expressed as a vector, vector density or strain vector. Thus the "electrical 
matrix" E l has not always an electrical significance. 



6-5] Reality Conditions 85 

theory of U-symbols is primarily a study of the group of rotations in space 
of six dimensions. But the ordinary vectors of that space (6-vectors) 
cannot be expressed in terms of -E-symbols. The physical application of this 
group is specialised by the fact that space, as defined in physics, is an 
abstraction of the extensional relations of neutral matter, and therefore 
involves the superposition of a right- and a left-handed frame in the 6-space. 
This superposition involves the selection of a pentad (here taken to be /5 0/t ) 
which, together with E IB , is reversed in sign between right- and left-handed 
frames. One suffix (in this case 0) thereby acquires distinctive properties; 
and, once chosen, it remains associated with an invariant direction in the 
6-space. The remaining five dimensions become the 5-space of 4-3. 

To review the argument: A right-handed frame can be represented as 
having complete symmetry in a (5-space. Such a frame would be appropriate 
as a reference system if the universe consisted of particles of one sign 
only. Since actual physical systems consist of nearly equal numbers of 
positive and negative particles, our actual reference system is based on a 
frame which is a superposition of right- and left-handed frames, the one 
being, as it were, a reflection of the other. In the composite frame it is 
no longer true that all directions in six dimensions are equivalent; one 
direction must be chosen as the axis of the reflection, and thereby becomes 
distinguished from the others. We have associated the suffix with this 
direction. Thus, whilst the elementary right-handed frame exhibits six- 
dimensional relativity, the actual composite frame exhibits five-dimensional 
relativity. Accordingly the starting point of the theory of actual space- 
time is the five-dimensional representation of wave tensors treated in 4-3. 

The kinematical rotations have the same relations to the frames E , F , 
Np , Mp . In each case they rotate separately the two parts into which (6-42) 
is divided, and the value of A does not affect the result. But the electrical 
rotations have opposite effects on E^ and F^ , and therefore correspond to 
some kind of separation of electric charge, or polarisation. For this reason 
they give apparently non-relativistic transformations of neutral or macro- 
scopic space vectors. This does not mean that it is unprofitable to investigate 
them further; but it is only by dropping the usual representation in Rie- 
mannian space-time, and following a unified geometrical representation 
of gravitational and electromagnetic fields, such as those of Weyl and the 
author, that these electrical transformations become admissible. 

6-5. Summary of the Reality Conditions. 

Since physical quantities can be expressed as space vectors, vector densities 
(four-dimensional), or strain vectors (three-dimensional densities), we have 
to state the reality conditions separately for these three forms. The following 
conditions refer to vectors, etc., in neutral space-time. 



86 Wave-tensor Calculus [6-5 

(1) Space vectors. 

The condition is that the electrical part, involving the matrices 



is antithetic to the kinematical part; or T e is antithetic to T k . 

(2) Vector densities. 

Since the vector density X-iTE &9 the above condition becomes 

(ZE 6 ) e is antithetic to (3^ 5 ) fc , 
so that the part of X, involving the matrices 

^16> ^159 ^25> -36> ^45> -5> 

is antithetic to the rest of the expression. 

(3) Strain vectors. 

Since the strain vector or three-dimensional density 8 = iTE^ 9 the 
condition becomes (SE ^ is antithetic to (8E^) k . 

By (4-66) the part of 8 which corresponds to the electrical part of I 7 is that 

containing v v v v v v 

o /& 23 , /& 31 , ^ 12 , /& 4 , ^ 5 , /& 46 , 

i.e. the real matrices. Thus the part of 8 containing real matrices is anti- 
thetic to the part containing imaginary matrices. We have therefore the 
simple result that the coefficients s^ of a strain vector are homothetic. 

(4) Kinematical rotations. 

These are restricted to ten components and their reality conditions, given 
in full in (6-23), secure that the part containing 

-15> ^25 > -^35 > -^45 

is antithetic to the rest. Comparing with the above results, we see that the 
matrix of a rotation must be regarded as a vector density. 

These conditions are founded on the result obtained in 6-3, that for a 
position vector JS? 5 a? 5 is antithetic to Ex+ Z? 2 # 2 4-^3^3 + jE? 4 # 4 . By 5-8, a 
position vector is part of a complete vector density. We have therefore to 
choose right- and left-handed frames related in such a way that the reality 
condition (2) for a vector density agrees with this. This justifies the choice 
made at the beginning of 6-4 when we took the connection to be that given 
by (3-93). 

The foregoing are the natural reality conditions in neutral space. But it 
must be understood that, since the only test is the invariance of physical 
reality, the conditions become less stringent as we limit the variety of 
transformations contemplated. Thus if we have a 5-vector U and a 10-vector 
F, forming a complete space vector U + V which satisfies the foregoing 
reality conditions, the combination U -f iV will violate them. But so long as 



6-5] Reality Conditions 87 

we confine ourselves to the ten kinematical rotations which transform 11 
and V separately, there is no special reason for preferring the combination 
U + VtoU + iV. It may well happen that U + iV is the customary form of 
combination. All we can say is that this latitude exists, because the expres- 
sion is a mental association rather than a genuine combination. Combination 
implies some loss of independence of U and F; so that a more general 
transformation is contemplated in which the combination U -f V persists 
and the combination U + iV is broken up. 

The hamiltonian H = E 1 p l + E 2 p 2 + E 3 p 3 + E^ m adopted in Chapter v 
satisfies the reality conditions for a vector density, since m is the com- 
ponent E^im. Accordingly J = ^* is a vector density. If we require the 
wave equation for a vector we must identify (p { , p 2 , # 3 , jp 4 ) with tho adjoint 
vector (J1&9J2&9 ./as > ^45) > an d therefore write the hamiltonian as 

H = E l5 p l + E 25 p 2 + Etopz + # 45 p 4 - m. (6-51) 

This satisfies the conditions (1) for a space vector. We need not stop to 
decide which of these forms is the most advantageous, because we shall find 
later that the form chiefly required in practice is that which corresponds 
to a strain vector. 

6*6. Charge and Spin. 

We have found in (5-83) that the complete stream vector of an elementary 
particle is, if the axes are suitably chosen, of the form 

J a = (E l + E 23 + EU + E IB ) im. (6-61) 

We have set a. = im in anticipation of the identification found below. We 
first notice that, since E 2S , E^ are real and E l , E IB are imaginary, J a satisfies 
the reality conditions for a space vector in neutral space-time ( 6-5 (1)). 

It may seem surprising that the result obtained in 5-8 is already adapted 
to neutral space-time, which was not formally introduced until later. The 
reason is that in the course of the derivation we selected the combinations 
of suffixes in accordance with experience-, and since we have no experience of 
electrically saturated space-time it was implicitly excluded. For a pure 
wave tensor in electrically saturated space-time the appropriate form is 

(E l + E 24t + E 35 + E 1B ) im, 

which is monothetic. It is easily seen that the momentum vector is then in 
the space-like direction E 3 . 
The hamiltonian is given by the last two terms of (6*61): 

H a = Eftim - E^im = E 46 ip + m. (6-62) 

We thus verify that m is the proper mass. The terms JE? 45 and E 19 in (6*61) 
accordingly represent the energy and the proper energy, which, although 



88 Wave-tensor Calculus [6-6 

equal in magnitude (owing to our special choice of coordinates), are exhibited 
separately in the wave tensor. 

To obtain a particle of opposite sign to J a we must reverse the sign of the 
electrical matrices, obtaining 

J b = ( E + #23 + #45 EM) im. 

The corresponding hamiltonian is H b - E^ip Q - m. It will be seen that if the 
standard form of the hamiltonian is taken to be 



the identification of m with the proper mass must be qualified by the 
proviso that this refers to the magnitude, and that the sign of m is that of 
the charge. The identification was based on (524) which gives m 2 , not m. 

The terms E l and U 23 describe a spin, the one giving the axis of the spin 
and the other the plane of the spin. But they play somewhat different parts, 
because one is reversed when we change the sign of the charge and the other 
is not. Clearly J5 23 , which is not reversed, represents the mechanical spin; 
and TJj , which is reversed, represents the magnetic moment. 

To obtain a particle of opposite spin we reverse the sign of J5 23 . In order 
that the wave tensor may remain pure, we must also reverse the sign of 
another term (5-5). We cannot reverse 7? 16 , since that would reverse the 
charge; we cannot reverse jE 45 , since that would give negative energy p (} , 
representing a "minus-particle". Hence we must reverse E. This confirms 
our interpretation of E l as a magnetic moment, whose sign depends on the 
direction of spin. 

Assuming arbitrarily that J a represents a positive charge with a "posi- 
tive " direction of spin in the plane x 2 x$ , we have the following classification : 

J a = im (E -f- #23 + ^45 + EM) positive charge, positive spin, 
J b = im ( - EL + EM + #45 ^ie) negative charge, positive spin, 
J c = im ( - E! - 23 + EM + E 16 ) positive charge, negative spin, 

J d = im (#JL - U 23 -f- E^ - E IB ) negative charge, negative spin. 

(6-63) 

The three-dimensional densities or strain vectors are also of importance. 
They are given by S = iJE^ . We obtain 

S a = im (E l -f #23 + EM -h E^ positive charge, positive spin, 
S b = - im (E 1 ^ 23 ,# 45 + # 16 ) negative charge, positive spin, 
S c = - im ( - # x # 23 4- JS/45 -I- E 1Q ) positive charge, negative spin, 

S d = - im ( E l + JE? 2 3 #45 4- E ie ) negative charge, negative spin. 

(6-64) 



6- 6] Reality Conditions 89 

We notice that in the strain vector the mechanical spin is represented by E 
and the magnetic moment by E^ opposite to the representation in the 
space vector. 

Since 8 a + S b + S c + S d = 4m, (6-65) 

the four forms can be regarded as resulting from a spectral analysis (5-7) 
of algebraic numbers. The properties of matter which is neutral as regards 
charge and spin can be represented by simple algebraic wave functions of 
the type introduced by Schrodinger. To introduce charge and spin Schro- 
dinger 's algebraic wave tensors are analysed into components $ a , S b , 8 C , S d , 
which can then be assigned modified probabilities independently, so that 
they no longer balance. This is the simplest way of connecting the theory of 
protons and electrons with the ordinary relativistic mechanics of neutral 
matter, as will be shown in 13-6. 

An entity represented by i (J a + Jb + J c + J d) or b Y I (8 a + 8b + S o + S d) 
will be called a neutral particle. The term is not intended to have any con- 
nection with the neutron. A neutral particle is not a combination of four 
particles; it is a single particle which (so far as our information goes) has 
equal probability of being a proton or electron and equal probability of 
either direction of spin. We regard S a , 8 b , fl c , 8 d as four elementary state* 
of a particle. In general the probabilities of the four states will be different. 
As a particular case (which, however, is the commonest case in practice) the 
probability of one of the states, say 8 a , may be unity; the particle is then 
classed definitely as a proton of positive spin. 

From the ordinary standpoint a neutral particle is a mathematical fiction, 
having no counterpart in experimental physics. Usually, if an experimenter 
knows anything at all about a particle, he knows whether it is a proton or 
electron. Direction of spin is less easily recognised; and a combination 
S + AS C , which gives a particle neutral as regards spin, may often represent 
the experimental knowledge available. 

The complete stream vector (space vector) of a neutral particle consists 
of a single component #45*45, and its three-dimensional density (strain 
vector) consists of a quarterspur E lB s^. Thus we have a very simple way of 
passing from electrical to neutral particles, namely by taking the quarter- 
spur of the strain vector an operation which corresponds to contraction of 
the corresponding wave tensor. 

In more general coordinates the term / 23 in the strain vector is replaced 
by three components jB 23 , E VL9 E 1Z . We have seen that these represent the 
magnetic moment. The electric moment (if any) will be represented by 
terms JE7 14 , E 24 , J 34 , since by the usual electromagnetic equations the 
magnetic and electric moments form a 6-vector. No such terms occur if the 
particle is at rest in the coordinate system; this is what we should expect, 



90 Wave-tensor Calculus [6-6 

since an electric moment implies a doublet, and cannot be associated with a 
single particle at rest. Electric moment terms are introduced if we set the 
particle in motion by applying a Lorentz transformation with matrix JE? 24 
or JE7 34 to (6-63). The electric moment thus produced is real. Dirac's theory 
seems to differ from ours on this point, since he obtains an imaginary electric 
moment for the electron.! 

6-7. Minus -particles. 

The wave tensor J a must presumably be regarded as expressing the 
absence of a particle of the kind represented by J a , or else an entity which is 
observationally equivalent to such absence. We may accept the current 
view (due to Dirac) that positrons and negatrons are minus-particles of this 
kind equivalent to the absence of electrons and protons. Thus 



represents a negatron. The sign of the last term shows that the charge is 
negative. 

The sign of the third term should signify that the energy is negative, 
according to the somewhat confusing definition of energy in wave mechanics. 
But the energy of a negatron or positron, as ordinarily understood, is cer- 
tainly positive. It is necessary to clear up this discrepancy of definition. 

Energy, momentum and spin are familiar conceptions in classical 
mechanics, and a complete energy -momentum -spin vector can be defined 
for a macroscopic system. Let us suppose that, by adapting this definition, 
we can assign to an elementary particle a complete vector T representing 
its energy, momentum and spin "as ordinarily understood". In wave 
mechanics we associate with the elementary particle a wave tensor J a = ^#*, 
whose components have at least some analogy with the energy, momentum 
and spin of classical mechanics. It is tempting to assume that T J a . But 
J n has the idempotent property that, apart from a numerical factor 
depending on the choice of units, J a 2 = Ja* We have therefore just as good 
reason to make the identification T = J a 2 . 

The proton and negatron which have opposite stream vectors J a , J a9 
have the same vector / a 2 . The identification T 7 tt 2 will accordingly make 
the energy of negatrons (and positrons) positive, as it should be. 

We therefore regard the primary wave tensor J = ^x* of an elementary 
particle as a charge-current vector, and 7 2 as the true energy-momentum 
vector. We have been considering the vector J for simplicity, but strictly 
the relation is between the three-dimensional densities S, $ 2 ; the charge- 
current density is represented by S, and the energy-momentum density 
byfl 2 . 

t Quantum Mechanics, 2nd ed., p. 263. 



6-7] Reality Conditions 91 

Since /S a is an algebraic multiple of /S, the two vectors coalesce for an 
elementary particle. This has happened by design rather than by accident. 
The coalescence of S* with S is virtually the definition of an elementary 
particle; for elementary character has been taken to correspond to purity, 
and purity to idempotency. This helps us to understand why so much im- 
portance is attached in wave mechanics to resolution into pure or idem- 
potent constituents. It is for these constituents that we are able to replace 
a quadratic dynamical property by a linear property. For impure con- 
stituents these would be distinct wave tensors describing different pro- 
perties. Broadly speaking S 2 represents mechanical characteristics and 8 
electrical characteristics; they become unified only in an elementary particle 
whose momentum and current coincide. 

This agrees with general relativity, in which the mechanical properties 
are specified by an energy tensor of the second rank (a quadratic function 
of the individual velocities), and the electrical properties by a charge- 
current vector (a linear function of the individual velocities). 



CHAPTER VII 
STRAIN VECTORS AND PHASE SPACE 

7- 1 . Internal Wave Functions . 

In classical mechanics it is usual to resolve the motion of a system of par- 
ticles into a motion of the centre of mass, and a motion of the individual 
particles relative to the centre of mass. We shall distinguish these as the 
external and the internal motions of the system. 

Similarly in wave mechanics we resolve the motion of a system into an 
external motion specified by external wave functions, and an internal motion 
specified by internal wave functions. So far as the external motion is con- 
cerned, the system is equivalent to a single particle located at the centre of 
mass. It is characterised dynamically by an external momentum vector 
(lh 9 Pz > #3 > jPo) an( i a proper mass m. If the space-time frame of reference for 
the external motion is changed, the momentum vector undergoes rotations 
and Lorentz transformations. The theory of the external wave function 
coincides with that of a simple particle. 

The internal wave function introduces new ideas. Lorentz transformations 
are not applicable to the internal motion; for, by definition, the internal 
motion is relative to the centre of mass, and, if we applied a Lorentz trans- 
formation to it, it would be referred to some other standard of rest. The 
time-axis of the frame of reference for internal motions and wave functions 
must agree with the direction of the external momentum vector. 

Thus we have a uniquely defined space-time frame for internal motions, 
and "simultaneity" has a definite meaning provided that the system is 
not so extensive as to make it necessary to take account of curvature of 
space-time. It is part of our mental conception of a complex system that 
it is a simultaneous aspect of its several parts. Each particle has three 
internal (relative) coordinates ^ = 2^ x^ and the momenta conjugate to 
f^ are the internal momenta of the particles of the system. There is one 
time-coordinate s common to the whole system. If the same coordinate 
system is used for the internal and external motions, the external time will 
also be s. But in general different frames will be used, since it would be idle 
to consider external motion if the external frame had always to be chosen 
so that the system was at rest in it; the external time t will then differ from 
the proper time s. 

The independent variable for the internal motions and wave functions is 
always the proper time s. It should be noticed that if, for special purposes, 
separate internal time-coordinates analogous to the internal space co- 
ordinates are assigned to the particles, these will be T=$ t. To associate t 



?!] Strain Vectors and Phase Space 93 

with Xp - Xp (as is sometimes attempted) is a hybrid procedure unwarranted 
by any theoretical principle. 

For internal wave functions Lorentz transformations are definitely ruled 
out; but relativity rotations in three-dimensional space are applicable. The 
direction of the time-axis is prescribed by the external momentum vector; 
but there is no corresponding specification of the orientation of the other 
axes. 

The use of internal wave functions which, by their very nature, cannot be 
subjected to Lorentz transformations is often called "non-relativistic 
treatment" with the implication that it conflicts with the principle of 
relativity. This is a misunderstanding of the nature of the Lorentz trans- 
formation and its place in relativity theory. So long as we deal with quan- 
tities defined as relations between physical entities relative coordinates, 
relative velocities, relative momenta we are on safe ground. (By relative 
coordinates we do not, of course, mean the difference of coordinates in some 
arbitrary frame of reference,! but the coordinates of one particle in the frame 
in which the other particle is at rest at the origin.) It is when we introduce 
into our formulae analogous quantities relative, not to physical reference 
objects, but to abstract frames of space and time, that Lorentz invariance 
is demanded. For we are then unable to define which of the equivalent 
frames has been selected to which frame our formulae apply so that, if 
the formulae are to mean anything at all, they must have a form invariant 
for all transformations of the frames. 

In relativity theory itself there has been no such tendency to let Lorentz 
invariance grow into an obsession. One of the best known formulae in 
relativity theory is ds 2 = -y- 1 dr 2 -r 2 d0 2 -r 2 sin 2 0d^ 2 H-ydi 2 for the line 
element in the gravitational field of a particle. This is not invariant for 
Lorentz transformations; but we can scarcely describe the formula which 
is the soiirce of the three crucial tests of Einstein's theory as "non-rela- 
tivistic". If Lorentz invariance is not demanded in the investigation of 
the motion of a planet round the sun, it can scarcely be demanded in the 
investigation of the motion of an electron round a nucleus. 

Hitherto, in developing the analytical theory, we have had in mind the 
motion of a particle or the external motion of a system. In the present 
chapter we shall introduce the modifications appropriate for treating the 
internal configuration and motion of a system. There are two fundamental 
differences. Firstly, the time reckoning will now be a proper time, fixed by 
the external momentum vector, and therefore invariant for any permissible 
transformations of the variables describing the internal configuration. 
Secondly, whereas change of the coordinates x^ of the centre of mass is a 

t This possible confusion of meaning does not arise in the case of relative momentum, 
which obviously has no connection with difference of momentum. 



94 Wave-tensor Calculus [7-1 

relativistic change, all points in the space-time hypersphere being equi- 
valent, change of the relative coordinates f^ is in general a strain or intrinsic 
distortion of the system, and is therefore not to be represented as a relativity 
rotation of the vectors describing the internal state. 

7- 2 . Co variant Wave Tensors . 

We shall now consider the transformations of a covariant wave tensor of 
the second rank 8^. Like the mixed tensor it is a matrix composed of 16 
elements; and it may be resolved into matrix components in an orthogonal 
frame by the same formula 8-I^s^E^ but s^ will not be a space vector. 
We shall call s^ a strain vector. Thus a strain vector corresponds to a co- 
variant wave tensor in the same way that a complete space vector corre- 
sponds to a mixed wave tensor.f 

The transformation formula for a covariant wave tensor has been found 
in (1-53), namely S' = qSq. (7-21) 

Let q = e^ E f e . First let E^ be an antisymmetricai matrix, so that 



Then (**" V = cos W . ( 1 ) ]3a + sin *0 . 



That is to say, 9^9"^ 

If EH is a symmetrical matrix, we have q = q. Hence (7-21) becomes 
S' = qSq~ l for antisymmetricai transformation matrices! 
= qSq for symmetrical transformation matrices. } 

The transformation qSq~* agrees with that of a mixed wave tensor (1-463). 
The transformation qSq has been called an antiperpendicular rotation 
(4-16). We have therefore the result: 

A strain vector behaves as a space vector when the transformation matrix is 
antisymmetricai, but undergoes antiperpendicular rotations instead of the 
corresponding ordinary rotations when the transformation matrix is sym- 
metrical. 

As in the case of space vectors, we shall use the name "strain vector" 
indifferently for the array of components s^ or for their symbolic combina- 
tion S^XSpEp. 

A strain vector may be represented graphically by plotting its com- 
ponents s in a 16-dimensional space. But it must be remembered that the 
line which represents it will be thought of as a space vector. When a tensor 

t This is the fundamental definition of a strain vector. It will be shown in 7-6 that 
three-dimensional vector densities are strain vectors, and they have therefore been called 
strain vectors in anticipation ( 4-6). 



7-2] Strain Vectors and Phase Space 95 

transformation is applied, all geometrical reference lines in the space rotate 
as space vectors; the strain vector follows its own transformation law, and 
is displaced relative to this background of space vectors. We shall now 
examine this relative displacement. 

Let S be a strain vector and T a space vector, and at first let T coincide 
with 8. Perform the infinitesimal transformation g = e* de , where d is a 
general infinitesimal matrix. We write d*=d 8 +d a , where d 8 , d a are 
its symmetrical and antisymmetrical parts. Since d is infinitesimal, q is 
equivalent to the transformations </ a = e* d and g a = e^* applied succes- 
sively. Thus we have 

S'-Matffc- 1 *., T f ^q s q a Tq a ^q 8 -\ (7-23) 

And, since 8 = T, S' = T'q* = T'e d *. (7-24) 

Hence, d 8 being infinitesimal, 

S'-T' = T'd 8 = Sd 8 . (7-25) 

We take S to be non-singular. Then 8d 8 ^0 for any non-zero value of 
d 3 . Since the general symmetrical matrix contains ten independent con- 
stants, the relative displacements Sd 8 will occupy a ten-dimensional space. 
We call this the phase space of the strain vector. 

To exhibit this graphically, let S and T be represented by the lines OQ 9 
OP in the 16-space. A tensor transformation of the space vector is repre- 
sented by keeping OP fixed and rotating the axes of reference, thereby 
altering the components ^ referred to the axes; thus P can be regarded as 
a fixed origin. At first Q coincides with P; but the transformation produces 
the relative displacement PQ = 8' T'. The interesting point is that, 
although there are 16 independent tensor transformations, Q is limited to 
a ten-dimensional locus. Provided that 8 is non-singular, every direction 
in this locus is a possible direction of PQ. 

This construction gives only an infinitesimal region of phase space, and 
we must extend it by a process of continuation. The problems which arise 
in constructing the complete phase space will be considered later. 

We have seen that antiperpendicular rotations represent intrinsic 
deformations of the physical system considered. The points of phase space 
therefore represent in systematic order different intrinsic states or con- 
figurations! of a system described by a strain vector. This corresponds to 
the usual definition of a phase space in statistical mechanics. In the applica- 
tions for which it is intended, phase space is occupied by a probability 
distribution. 

t I shall generally use the term configuration; it is to be understood in its broadest sense. 
State would express the meaning better; but I was anxious to avoid using a term which has 
been given a technical significance by Dirac. The term phase is often used in statistical 
theory; but I have reserved it for the angular variables occurring in the exponentials. 



96 Wave-tensor Calculus [7-3 

7*3. Real Phase Space. 

For treating strain vectors and phase space we adopt a frame E^ consisting 
of four-point matrices. The E^ then consist often symmetrical and six anti- 
symmetrical matrices; the symmetrical matrices are imaginary and the 
antisymmetrical matrices real. In 6-1 space-like directions have been 
associated with imaginary matrices and time-like directions with real 
matrices; it is convenient to use these descriptions generally as a nomen- 
clature rather than as an anticipation of the physical manifestations of the 
vector components associated with them. Having regard to a future 
extension to double wave tensors, the definition is best stated in the form: 

A space-like matrix is homothetic with its eigenvalue; a time-like matrix is 
antithetic to its eigenvalue. 

The eigenvalues of the E^ are imaginary. Thus, for simple wave tensors, 
we have the equivalence: 

Space-like = symmetrical = imaginary matrices, 
Time-like = antisymmetrical = real matrices. 

The ten dimensions of phase space are space-like. 

We can no longer represent the E^ by general fourfold matrices, since 
these are usually neither symmetrical nor antisymmetrical. At first sight 
the limitation to a particular frame of matrices seems a serious loss of 
4 * relativity". But the application of this chapter is to the internal wave 
tensors of a system, which, as we have seen, are not subject to Lorentz 
transformations. It is not merely permissible to use a fixed frame; it is 
essential that the frame to which these internal tensors are referred should 
have some quality which resists Lorentz transformations. J 

In treating external space we considered it to be a drawback that in 
matrix representation the E^ have properties additional to those which 
they are defined to have as constituents of a complete orthogonal set. But 
the additional properties come in useful in internal space, because we have 
to indicate a distinctive direction, viz. that of the external momentum 
vector, and the planes of simultaneity orthogonal to it. We do this by giving 
to the matrix J5 4 , belonging to the distinctive direction, the property of 
antisymmetry; matrices belonging to directions in the plane of simultaneity 
are symmetrical; matrices belonging to intermediate directions are neither 
symmetrical nor antisymmetrical. 

We have seen that one of the difficulties of applying to physics an algebra 
comprising complex numbers is that half the mathematical possibilities 
have to be set aside as unreal, i.e. not corresponding to actual phenomena. 
We approach the problem of determining the reality conditions of phase 

f It is the Lorentz transformations of the frame which introduce asymmetry. For spatial 
rotations the matrices preserve their symmetry or antisymmetry. 



7-3] Strain Vectors and Phase Space 97 

space somewhat differently from the similar problem in Chapter vi. The 
feature of phase space is that it is the seat of a probability distribution. If the 
probability distribution does not extend to the unreal configurations if they 
have zero probability there is no need to stigmatise them further. Accord- 
ingly the discrimination of the world of real phenomena from the world of 
unreal phenomena is made when we insert the probability distribution. We 
shall show that this fixes the reality conditions of phase space unambiguously . 

The method of statistical mechanics contemplates an initial or a priwi 
probability distributed over phase space so that the probability in any 
volume is proportional to the volume. Observational information modifies 
this initial probability, so that the actual pro bability concerned in a particular 
problem is the product of the initial probability and a modifying factor. It 
is essential to the method that the phase space over which the probability 
is distributed should have finite volume; for if it were infinite, the initial 
probability associated with any finite region would be zero, and the method 
would break down. This requires that phase space should be a closed space. 

It is true that statistical mechanics is often applied to a space closed by 
a supernatural barrier instead of by its own re-entrancy . But the barrier is 
merely a compression of the curvature required to close the space into a 
singularity. If we reverse the motion of the particles by natural fields of 
force instead of supernaturally, space must be curved to represent these 
fields. No ordinary field is such that an electron has zero probability of 
leaking through; to confine it rigorously the curvature must be sufficient to 
close the space. 

Thus, in order that our phase space may be the seat of a probability dis- 
tribution, it must be closed. This means that the matrix dQ^^E^dQ^, 
which gives the displacements in phase space, must correspond to circular, 
not hyperbolic rotations. This requires that the dO^ shall be real (cf. (4-15) 
and (6-13)). Then, since the symmetrical E^ are imaginary, d& 8 is imaginary. 
We have the result: 

In order that the displacement in phase space given by the transformation 
q e \d s ma y represent a real change of configuration, d 8 must be an 
imaginary matrix. (7'31) 

This reality condition may also be expressed by saying that the space is 
a phase space in the other sense of the term phase. Instead of a single algebraic 
phase 8 indicated by a factor e ie , we have ten phases for which i is replaced 
by different matrix roots of - 1. Since the 0^ are real, they are real periodic 
phase angles in the ordinary sense. 

We can see this most easily by taking the initial value of the strain vector 
to be unity. This involves no loss of generality. By (7-24) 8' = T'e d **. Since 
S' and T' are non-singular, T' has a reciprocal T'" 1 ; then 



98 Wave-tensor Calculus [7-3 

The transformation law of T-*S is 



so that T~ 1 S is a strain vector, and its initial value is 1. Thus the phase space 
generated by the transformations of an arbitrary non-singular strain vector 
8 may equally be regarded as generated by the transformations of a unit 
strain vector, viz. (1)' = 1 e d& *. (7'32) 

If d& 8 does not involve non-commuting components, (7-32) can be in- 
tegrated so as to apply to finite displacements. Thus there will be a line of 
configurations for which the strain vector is 



If is real, the strain vector repeats itself at intervals 0^ = 2?r, so that we 
return to the original configuration. In other words phase space is re-entrant 
in the 0^ direction. If 0^ is imaginary, the strain vector is non-periodic and 
the phase space is open in that direction. Hence, in order that phase space 
may be closed, real configurations must correspond to real phase angles 0^ . 

Following an arbitrary track in phase space the transformations are in 
general non-commutative. Thus the exponentials combine by non-commuta- 
tive multiplication, and the increments of the phases combine by non- 
commutative addition. The ten-fold phase is therefore non-integrable; this 
means that it must be represented in a curved space. 

By 3-6 the transformations g = e* A V^^, in which dO^ is real, are all 
unitary. Thus a strain vector which is non-singular initially remains non- 
singular throughout phase space. 

By (7-31) and 6-5 (3) the matrix d& s , which represents displacement in 
phase space, satisfies the reality conditions for a strain vector, but not those 
for a space vector or vector density. We must therefore take d 8 to be a 
strain vector. 

7-4. Coordinates in Phase Space. 

Let g = e id be the transformation which displaces a point Q to a neigh- 
bouring point Q' in phase space, and let 

d&^^E^. (7-41) 

Then the ten 0^ provide locally a coordinate system for describing points Q' 
near an origin Q. We call these local orthogonal coordinates or natural 
coordinates. A corresponding system of linear coordinates x^BO^ is 
introduced by attributing to the phase space a definite scale constant jR. 
For the present B must be considered arbitrary, since it could only be 
defined by introducing relations to an extraneous system. 
The volume of a ten-dimensional element of phase space is defined to be 

do>=d0 1 d0 2 ...d0 10 . (7-421) 



7-4] Strain Vectors and Phase Space 99 

It is a scalar quantity. We can, if we prefer, write it as E l dd l .E 2 dd 2 ... E w dd w , 
but the product of the ten matrices is found to be 1. We thus have a 
definition of equal volumes at different points of phase space, each volume 
being measured in terms of natural angular coordinates at the point where 
it lies. By this definition equivalent volumes are equal volumes. As in 2*9 
equivalent volumes are formed by making the same construction in 
equivalent frames. 

For any other system of coordinates x^ , we have in the notation of general 

relativit y d^V^g.dr, (7-422) 

where dr = dx^dx^ . . . dx lQ . 

In particular for natural linear coordinates 

V^7==jR- 10 . (7-43) 

The local orthogonal system is only applicable when the squares of 0^ are 
neglected. The most important equations of physics are differential equations 
of the second order, and in order to investigate them it is necessary to intro- 
duce a coordinate system valid at least as far as the squares of the co- 
ordinates. This problem will be treated in the next section. We have not 
much occasion to employ the properties of phase space as a whole, and our 
methods are chiefly adapted for treating an infinitesimal region. But it is 
important for our theory to prove that the whole volume of phase space is 
finite. Although this seems rather obvious, I have had some difficulty in 
proving it formally. We have adopted circular rotations in order to secure 
finiteness; but until we examine how they are to be extended beyond an 
infinitesimal region we cannot be sure that they will achieve this end. They 
secure re-entrancy along geodesic tracks, but we have still to prove that 
phase space has no tortuous exit. Although phase space has the same kind of 
uniformity as a hypersphere, it is different from a hypersphere; it contains 
pairs of antiperpendicular directions. Thus ordinary spherical coordinates 
do not apply, and I do not know of any suitable adaptation of them. 

There is a fundamental difficulty in specifying finite deformations or 
strains of a system, which arises in the following way. Let A l9 ^4 2 denote 
two different orientations of the same (unstrained) system, and let A 1 
denote the system in a strained condition; how are we to decide whether the 
strain should be measured by A' A l or A' A 2 1 There is no absolute 
one-to-one correspondence of the orientations of strained and unstrained 
systems; but it is necessary to lay down some conventional rule which will 
prevent our representing the same deformation twice over as A' A l and 
A'-A 2 . 

We have secured a unique representation of infinitesimal strains. The 
initial (unstrained) state was represented by S or T\ the strained state 8' 



100 Wave-tensor Calculus [7-4 

was then compared with T", which represents the unstrained state in a 
different orientation connected with S' by tensor rules. This cannot be 
extended to finite regions, because the transformations T-+T' are not 
integrable. To meet this difficulty, consider the transformation 

3 =n^ = IIe*^M, (7-44) 

the product consisting of ten factors with space-like E^ arranged in a fixed 
order. The elementary transformations q^ are applied successively (in the 
reverse order to that in which they are written). The ^ are not infinitesimal. 
Applying this transformation to a strain vector, so that $->$', we obtain a 
strained configuration which will correspond to some point Q f in phase space. 
We can adopt ^ as the coordinates of Q'. 

Let one of the ^, say <f> a , receive an increment d<f> a . Let the new point 
be #", and the new value of q be q". For two matrices q, q" which differ 
infinitesimally we can find a matrix d such that 

q" = e***q. 
It is easily seen that d = XE d<j> X-*, (7-45) 

where X is the part of the product IT which precedes q a . X cannot be singular 
(3-6). In general d will include time-like matrices. Since e* rfw is the 
transformation which changes Q' to Q", the components of d s are the 
natural coordinates dO^ of Q" referred to the origin Q'. 

Similarly, we can express the other displacements d^ in natural co- 
ordinates dOp at Q'. The jacobian 3 (^J/3 (0^) gives the ratio of the volume 
element dr = dfadfa . . . d<f> w to the natural volume element da) = d9 l d6 2 . . . d0 10 
and hence determines V g in the usual formula da) = V gdr. 

The half-period of each of the ^ is 2?r. After each half-period the values 
of q repeat themselves with opposite sign, and the corresponding strain 
vectors repeat themselves; so that the half-period represents a circuit of 
phase space. Hence for the whole of phase space, or of that part of phase 
space covered by the coordinate system ^ , 



Thus unless V g becomes infinite anywhere, or unless there are configura- 
tions not obtainable by the transformation (7-44), the volume Jdeo is finite. 
Since X cannot be singular, (745) shows that 30^/3^, is not infinite, and 
hence V~-^g is never infinite. There are, however, loci where V-gr = 0. For 
example, if the first two factors of II are e*^"e*^, where E v , E a anti- 
commute, we have by (7-45) for a displacement d<f> a 

d = e^^E a d^ a e^^ e E ***E a d<t> . 

On the locus ^ p = i^, this becomes E v E a d<f> a . The product of two anti- 
commuting space-like matrices is a time-like matrix; thus d is wholly 



7-5] Strain Vectors and Phase Space 101 

time-like, and d 8 vanishes.f But this type of singularity, in which a small 
natural volume is infinitely magnified in coordinate volume, is harmless for 
our purpose. 

I have little doubt that a geometer could furnish a more elegant proof. 
Probably he could evaluate the volume. But 1 think the above investigation 
satisfies us that the volume of phase space is finite. 

7*5. Stereographic Coordinates. 

One of the most important practical steps in the theory is to provide the 
analytical machinery for investigating differential equations of the second 
order. For this purpose we introduce a system of coordinates valid (under 
certain restrictions) for finite regions of phase space. 

On a hypersphere of radius R, Stereographic coordinates are such that the 
line element is 



_ )? (7 . 51) 

where r a = # 2 + y z + z 2 + . . . . The coordinates are thus locally orthogonal and 
isotropic, but not uniform; the actual length ids is A r times the Euclidean 
length (dx 2 + dy 2 +dz 2 +...)*, where 

A^l + r 2 /^ 2 )- 1 . (7-52) 

The hypersphere is in this way projected into a Euclidean space with a 
variable gauge factor A r . 

Analogous coordinates x^ in phase space are defined as follows. Let 
X = S 8 ^o; /t , the summation being restricted to the space-like matrices. 
Then Stereographic coordinates are such that the displacement from the 
origin to the point x^ corresponds to the transformation 

The right-hand side is to be interpreted by expanding in infinite series. This 
formula is limited to a domain containing only perpendicular coordinates 
together with its infinitesimal neighbourhood in all directions. That is to 
say, X is limited to a pentadic expression or to a single algebraic variable, 
but dX is unrestricted. The inclusion of the infinitesimal neighbourhood is 
essential, because these coordinates are used principally when we are treating 
the complete ten-dimensional volume element of phase space. 
Accordingly X* is algebraic, J and we set 

Z 2 = -r 2 . (7-54) 

t This is illustrated graphically by taking E ff , E to correspond to rotations of a sphere 
in the planes xy, yz. Taking an origin on the x axis, a displacement from the origin to any 
point on the sphere can be represented by two such rotations in the order given. Different 
values of <f> a , <f> v will give different points, unless </> v = k 7r or <t><j~^- This example shows 
that the singularity of the representation at these two points does not signify a failure of 
the coordinate system ^ M to cover the whole of the space. 

% It is tempting to describe the domain of X as being limited by the condition that X 2 is 
algebraic; but we define it more stringently in order to exclude compact ^-numbers (5*66). 



102 Wave-tensor Calculus [7-5 

If displacement from x^ to x^ + dx^ corresponds to the transformation 
e** e , we have by (7-53) 

<* JHJW*_ (l(1Xm\ l . (7-55) 

C tl-Z/2JZJ -\l-(X+dX)l2R\ l j 

When -2C is limited as above, the most general matrix dX can be divided into 
two parts, one of which commutes and the other anticommutes with X\ for 
the limitation secures that a complete set can be formed with X/r as one of 
its members; and if dX is resolved in that frame, its components either 
commute or anticommute with X/r. The transformations corresponding to 
the two parts of dX can be treated successively, since they are infinitesimal. 
First, let dX commute with X. Then, since there are no non-commutative 
symbols, (7-55) can be solved like an algebraic equation, giving 



d@ 



| log r 



Next, let dX anticommute with X. Then the differential of any even 
power of X is zero. Since 



dX/2R _ 



Hence by (7-55) 



By definition Jf and dX contain only space-like matrices, and since they 
anticommute their product is time-like. Thus (7-56) and (7-57) give the 
same value of d 8 d ^ (l + r *i B *)-i dX IR = \ r dXIR (7-58) 

by (7-52). 

Thus the differentials dx^ are the natural linear coordinates at the point 
considered, but the scale constant B/\ r is variable precisely as in (7-51). 

The volume of a ten-dimensional element being 

dw = dO^dO^ . . . d9 lQ = V gdx^dx^ . . . dx lo , 
we have by (7-58) _ 

V - g = (A,/!?) 10 = J2- 10 ( 1 + r 2 /.R 2 )- 10 . (7-59) 

The following theorem is required later: 

If a transformation X (not infinitesimal, but not containing antiper- 
pendicular components) is applied to the strain vectors, the stereographic 
coordinates of all points in the infinitesimal neighbourhood of the origin are 
changed by the same amount, to the first order. 



7-6] Strain Vectors and Phase Space 103 

Let the stereographic coordinates 0, dx^ of the origin and a neighbouring 
point be changed by the transformation to x^ , x^ + dx^ ; and let dX = S E^ dx^ , 
etc. We have to show that dX'=dX. The transformation is that given by 
(7-53), and we have 

ilX!2R^ _ 
U-Z/2IZJ ~ 

where d3T" differs (if at all) from dX' by including time-like components,! 
By the conditions imposed on X , we can divide dX into two parts which 
respectively commute and anticommute with X, and treat them separately. 
For the commuting part the left-hand side can be treated algebraically, and 
gives immediately the required result dX" = dX. For the anticommuting 
part we proceed as in obtaining (7-57) and find that dX" differs from dX by 
time-like components only, so that dX' = dX. 

This result may also be stated in the form: In stereographic co- 
ordinates, a finite displacement of the above restricted type commutes 
with all infinitesimal displacements. 

The results of this section are used extensively in Chapter xn. 

7-6. Associated Strain Vectors and Space Vectors. 

We employ four-point matrices, jB 4 , E 6 being as usual the real members of 
a pentad. Let e^ be a covariant wave tensor which has the value JSL in the 
coordinate system initially chosen. Then, after a transformation g, we have 
fysssgJE^gr 1 or qE^q according as q contains a time-like or a space-like 
matrix (7-22). 

The wave tensor e 46 has the remarkable property that it is invariant for the 
ten rotations in five dimensions (kinematic rotations). This is easily verified, 
remembering that by the above formulae it is unaltered by transformations 
with real matrices with which it commutes or imaginary matrices with 
which it anticommutes. Thus for all orientations of the axes in five dimen- 
sions * 



No other strain vector has this property. It may be compared with the 
metrical tensor g^ = 8^" and the contravariant tensor density * V(rr in ordinary 
tensor calculus, which are likewise exceptional in having invariant values. 
In ordinary tensor calculus we define associated covariant and contra- 
variant vectors A^ A* by the relation Ap^g^A*. We shall now define 
associated (initial) covariant and contravariant wave vectors 0*, #*. A 
linear relation between them must be of the form 



t The left-side gives the transformation -> cfo M followed by the transformation -> x^ . 
By definition this takes us to the point # M +<fo/ . We cannot immediately identify it with the 
direct transformation -> x^+dx^', since an infinitesimal time-like matrix may be included. 



104 Wave-tensor Calculus [7-6 

where a^ is a covariant wave tensor. Consider the special case a^p = i (e 45 ) a . 
Then ^^t'x* (45)00 by (7-61); or, dropping suffixes, 

f = *y 45 . (7-62) 

Wave vectors connected by the relation (762) will be called associated wave 
vectors. 

Thus in wave-tensor calculus, (iE^)^ plays the part of g^p in the operation 
of lowering a suffix. f Since (i# 45 ) 2 =l, we have also x* = ^*^?45, which 
defines the operation of raising a suffix. Since four-point matrices are used, 
raising or lowering a suffix is a rather simple process; for example, in the 
standard pentad (3*27), 

J 45 = 0010 

0001 

-1000 

0-100 

Hence &,**,**, #4 = *' (* 3 > X 4 , " X\ ~ X 2 )- (7-63) 

Now multiply (7-62) by a covariant wave vector ^ by outer multiplication. 
We have * 



Or, denoting the covariant wave tensor 0<* by S, and the mixed wave 
tensor fc* by T, !3=iTE K . (7-64) 

We have therefore the important result: 

If a space vector T is multiplied by iJ5? 45 we obtain an associated strain vector 
S. Reciprocally, if a strain vector 8 is multiplied by iJE7 45 we obtain an associated 
space vector T. 

The components of S are obtained by shuffling the components of T, and 
in some cases inserting factors i\ the process is thus somewhat analogous 
to (7-63). We have already given the precise relation between the s^ and ^ 
in (4-66). 

The relation (7-64) is invariant for kinematical rotations, but electrical 
rotations ( 6-4) are excluded. It would be undesirable to exclude electrical 
rotations of a strain vector; because a strain vector is used primarily in 
connection with phase space, and some of the directions of displacement in 
phase space correspond to electrical matrices. But we have suggested that 
the term "space vector" implies that only kinematical rotations are con- 
templated ( 4'3). It would appear therefore that the strain vector is the 
more fundamental conception, and that space vectors are a derivative 
conception introduced into physics by the formula (7-64). That is to say, 
we take 8 to be a covariant wave tensor for all transformations; then the 

t If we could depend on the suffixes appearing explicitly in the formulae, we should 
naturally use the notation x", # a for the two associated wave vectors; but since suffixes are 
generally omitted, we have to distinguish them by different letters x> <f>. 



7-6] Strain Vectors and Phase Space 105 

matrix T determined from it by (7-64) will behave as a mixed wave tensor 
for kinematical rotations but not for electrical rotations; so that it will be 
rigorously a space vector, but only imperfectly a mixed wave tensor. If all 
space vectors in physics originate out of strain vectors in this way, we see 
why there is never any occasion to employ in practice the extra relativity 
transformations possessed by space vectors derived from mixed wave 
tensors. We have previously attributed the absence of these transformations 
to the neutrality of space-time. The two explanations are connected; because 
a system composed of particles of one sign could have no equilibrium con- 
figuration, and give no foothold for statistical mechanics with its attendant 
conception of phase space. 

Since (7-64) is the same equation as (4-65), S is the three-dimensional 
density of T\ and we reach our earlier definition of a strain vector as the 
three-dimensional density of a space vector. The relation is reciprocal, and 
T is also the three-dimensional density of 8. But the discussion in 4-6 was 
limited to an infinitesimal region round the origin, where the volume element 
of the 3-space was dW J2 ^ = iE^dw. If we move away from the origin, the 
matrix of dfF 123 will no longer be the original ,645. In addition to the change 
of direction of the normal to space-time, we must allow for an arbitrary 
rotation of the time direction, since there is no absolute way of defining the 
reckoning of simultaneity over an extended area. The general formula for 
the volume element is dW 123 =i(e u )J>dw, (7-65) 

where (e 45 ) a ^ is the space vector which has the value E^ in the original co- 
ordinate system at the origin. Hence the three-dimensional density of T is 
iT (^g)/, whereas the strain vector is iTE^^iT (e 45 ) aj3 . 

We see therefore that the elementary definition of a strain vector as the 
three-dimensional density of a space vector, and vice versa, does not hold for 
an extended curved region. It can, however, be preserved if we represent 
the element of volume as a strain vector with matrix (e 45 ) a instead of in the 
more familiar way as a space vector with matrix (e 45 ) a ^. To regularise this 
we distinguish an internal and an external three-dimensional space. They 
consist of the same points; a particle which is in one is in the other; but the 
metrical conceptions are different. The internal space is part of phase space, 
corresponding to three of its ten dimensions. We have seen (7-421) that the 
whole volume element is scalar; the separation of the three and the seven 
dimensions is so drawn that each combination is a strain vector with matrix 
JS7 45 , constant throughout phase space, and therefore in a sense characteristic 
of the whole phase space. 

The phase space, including the internal space, is described wholly by 
strain vectors. It will be remembered that the matrix d@ 8 , determining 
displacement in it, was found to be a strain vector ( 7-3). But when the 



106 Wave-tensor Calculus [7-7 

three dimensions are separated from the others, the densities or fluxes of 
strain vectors with respect to the three-dimensional volumes constitute 
space vectors. These require for their representation an external space. 

7*7. Normalised Strain Vectors. 

We shall now transform the wave equation so that it applies to strain vectors. 
The equations giving the factors iff, x* f a space vector are 

/fy = 0, X *ff = 0, (7-71) 

where H = E^Pi + E 26 p 2 + E 3b p 3 + E^p^ - m, by (6-51). Let 

H 8 = - EM H = Eu Pl + E 2 tp 2 + E 3 tp 3 +pt + E^m. (7-72) 
Then (7-71) becomes E^H 8 ^ = 0, x* E *& H 8 = > 
so that #0 = 0, <*# 8 = 0, (7-73) 

where <* = *x*^45 by (7-62). These are the wave equations for the factorisa- 
tion of a strain vector ^*. The hamiltonian H s is part of 0^*. If p^ ipo, 



we have . (7-74) 



This satisfies the reality conditions for a strain vector ( 6-5 (3)), the coeffi- 
cients in (7-74) all being real. The energy p Q is associated with the algebraic 
matrix J57 16 ; and the coordinate conjugate to it, namely the time t, must also 
be associated with E 1Q . Accordingly in phase space the algebraic phase 
represents the time. 

Since the phase space corresponds to the internal configurations of 
a system, the time 16 is measured in the direction defined by the external 
momentum vector ( 7-1). 

In phase space the algebraic phase may be separated from the other phases, 
leaving a nine-dimensional space. This is permissible, because the algebraic 
phase commutes with all the others. If dco c is the nine-dimensional volume 
element and d0 16 the algebraic phase, so that 

da> = daj c d6 u , (7-751) 

we can treat d0 16 separately in integrating; so that, if Q is the volume of the 
ten-dimensional phase space, and ti c the volume of the nine-dimensional 

s P ace ' a=Q c .27T. (7-752) 

In defining phase space we associated a strain vector 8 with each point. 
We shall now more definitely associate a strain vector S = Sdo> c /Q c with a 
range of configurations rfo> c . This strain vector will serve three purposes: 

(1) Its algebraic phase indicates the time. 

(2) Its symbolic phases describe a particular configuration. 

(3) Its amplitude indicates the probability of the system having this 
configuration to within a range do> c . 



7-7] Strain Vectors and Phase Space 107 

This interpretation applies differentially to small changes of , but it is 
not so easy to interpret S itself. For the latter purpose we must introduce 
the determinant of S. Since det S is invariant for all non-algebraic displace- 
ments in phase space ( 3-6) it is independent of (2). For a purely algebraic 
strain vector, detS = S 4 . If then we take as "origin" the configuration for 
which the strain vector is algebraic (regarding this as the standard un- 
strained condition), we have 

(1) The time is the argument of the complex number (det Z)*. 

(2) The configuration is specified by the matrix S/(detS)*. 

(3) The probability is the modulus | (det S) | . 

It is a most important feature of the symbolic theory that tlie same 
symbol specifies both the configuration and the probability that the system has 
that configuration. 

The probability distribution which we contemplate initially is uniform 
throughout phase space, so that an element dcu c contains probability dco c /l c . 
Hence for the initial probability | (det S)l \ = 1. In particular, at the origin 
(standard unstrained configuration) S=l. This initial or a priori distribution 
is the framework the "blank sheet" into which we insert whatever we 
may learn about the system by special observation. If the observational 
evidence shows that at a time t certain configurations were more probable 
than others, we inscribe on the blank sheet a function /(#i, 0%, ... t) in- 
dicating that the actual probability of the configuration (6 lt 2 , ...) was/ 
times the initial probability. We call / the modifying factor. The modified, 
i.e. the actual, distribution is therefore represented by a strain vector S 
which does not in general satisfy | (det Sft | = 1. The factor / is necessarily 
an algebraic function. In wave analysis it is expressed as the product of two 
wave functions, and is therefore formally a wave tensor of the second rank 
in which all terms are zero except the quarterspur. This suggests a 
generalisation of the modifying factor /. It often happens that in intro- 
ducing the modified probability we make at the same time a transformation 
of coordinates; that is to say, we compare the modified probability of 
a configuration with the initial probability, not of the same configuration, 
but of a configuration related to it by a transformation. The trans- 
formation and the modification of probability are comprised in a non- 
algebraic modifying factor/, which is the product of vector wave functions. 

Whether we are treating the initial or the modified distribution, its strain 
vector is normalised so that the total probability in the volume ii c is 1. In 
this normalisation the time dimension 16 is excluded, because the con- 
ception of distributing probability over extension in time is rather unusual. 
An atom exists continuously in time, so that the association of its pro- 
bability with particular time intervals dt does not arise in a direct way. 



108 Wave-tensor Calculus [7-7 

But consider a clock with one hand moving at a uniform rate. If the clock 
is part of the system, we shall, in specifying the configuration of the system, 
specify the position angle 16 of the hand of the clock; and the elementary 
range of configurations dot will include the element d0 16 of position angle. 
Thus from the internal point of view 16 is an ordinary angular coordinate 
in the specification of configuration, and is one of the dimensions of the 
domain over which the unit probability is distributed. It may be included 
in the general normalisation; or (owing to its commutation with all the other 
phases) it may be excluded and normalised separately, so that the pro- 
bability d0 16 /2?r of a range d0 16 is stated separately. 

It was rather surprising to find time appearing at all in phase space. We 
apparently rid ourselves of it when we retained only the displacements 
associated with space-like matrices. But it has gained entry in space-like 
disguise as the position angle of the hand of a clock. Moreover, it is a periodic 
angular coordinate, not an open infinite coordinate like external time. 

This may be made clearer by an illustration from celestial mechanics. 
The orbit of a planet is specified by six elements, one of which is the epoch 
of perihelion passage T. The element T corresponds to 16 . If there are 
several planets, we must include the element T for each of them in enumer- 
ating the possible systems which might be formed. Thus T is an essential 
coordinate of the configuration space in which we represent the possible 
combinations, although it is ordinarily conceived as having a time-like 
character. 

Since time is measured by a phase angle, instants which differ by multiples 
of a period are to be considered identical; and the whole extent of time is 
27r.fi in natural linear coordinates. That is because in the structure assigned 
to the system there is no provision for a revolution counter. A more extended 
time reckoning can only be given a meaning when we treat more complicated 
systems. It is fairly obvious that infinite time will appear automatically 
when we introduce systems with incommensurable periods. 

7*8. Physical Meaning of Strain Vectors. 

By the aid of space vectors we have defined a domain which has the primitive 
property of conceptual space-time, namely that all points of it are equiva- 
lent. It is all one whether a given object is at the point A or at the point B. 
But this is in flagrant contradiction to our experience that somehow it is 
possible to find out that the object is at A, not B. We have to combine two 
different conceptions of position absolute and relative. By absolute I mean 
4 'conceived as absolute", i.e. pictured in an abstract geometrical frame; by 
relative I mean "relative to observable landmarks". In so far as the points 
of space-time symbolise absolute positions they are equivalent to one 
another; in so far as they symbolise relative positions they can be dis- 



7-8] Strain Vectors and Phase Space 109 

criminated observationally. According to whether the point is being con- 
sidered in its absolute role or its relative role, the position vector defining it 
is a space vector or a strain vector. 

The same applies to other physical entities. The energy and momentum 
constitute a vector. Referred to an abstract geometrical frame, this is a 
space vector, but relative to another physical system it is a strain vector. 
The difference between space vectors and strain vectors might be defined 
by saying that when we treat of space vectors we are contemplating the 
" blank sheet"; when we treat of strain vectors we are beginning to write 
something on the clean page. 

It is true that when we contemplate two space vectors we are implicitly 
writing something on the clean page, namely the internal relations of a 
system consisting of two parts. But the treatment of two vectors, and the 
extraction of the internal relations of that which they represent, is slightly 
more advanced than the problem we are now handling, and is not suitable 
for our first writing lesson. We here treat the internal relations as already 
extracted and presented to us in the form of a strain vector. 

The conception of relative position or relative momentum arises when, 
instead of contemplating the particle as a solitary system in an abstract 
frame, we regard it as part of a more comprehensive physical system. 
Consider, for example, an electron in an atom. A displacement of the 
electron has two aspects. It is a translation of the electron from one point 
to another in external space-time; as such it is a change of the space 
vector defining the absolute position of the electron. But it is also a 
deformation or strain of the atom; as such it is a change of a strain 
vector. The atom is changed to a new configuration, so that there is a 
displacement in the phase space in which the configurations of the atom 
are represented. 

The tensor calculus provides a machinery for locking the changes of one 
characteristic to those of another. The changes of strain of the system are 
locked to the changes of absolute position of its parts. We should commonly 
say that the change of position of the electron is the cause of the change of 
strain of the atom. To work out this, connection in an actual system of 
several particles is too complex a problem for us at present. We proceed in 
the converse way. We consider the simplest form of tensor interlocking, and 
construct the ideal "system " to which it would apply. We cannot find any 
more elementary starting point thaji the locking of a covariant wave tensor 
to a mixed wave tensor; and the former has been called a strain vector in 
anticipation of this application. 

Accordingly when we wish to pass from the absolute to the relative 
aspect of the position of a particle, we treat the particle as an element of an 
ideal physical system whose configurations (so far as they are determined by 



110 Wave-tensor Calculus [7-8 

this particle) are specified by a simple strain vector, and therefore occupy 
a phase space of the kind we have been investigating. 

What exactly has been added to the particle to make it part of a physical 
system instead of a lone particle to de-absolutise it? Mathematically it is 
the matrix J? 45 , which transforms its space vector into its strain vector. This 
matrix defines a particular three-dimensional section of the world a plane 
of simultaneity. Physically the particle is made part of a system by asso- 
ciating with it a plane of simultaneity. 

We have seen that when a number of particles are treated as a combined 
system, each particle retains its separate space coordinates, but there is only 
one time coordinate for the whole system. This replacement of the in- 
dividual time coordinates by a common time coordinate is the essence of the 
process of combining ; it defines the change in our point of view when we 
consider the system as a whole instead of its constituent parts. A hydrogen 
atom is composed of a proton and electron; but a proton today and an elec- 
tron yesterday do not constitute a hydrogen atom. We have seen ( 7*1) that 
the planes of simultaneity which correspond to the common time coordinate 
of the system are determined by the direction of its external momentum 
vector. 

We can now see how the conversion of space vectors into strain vectors 
corresponds to our change of attitude when we consider the particle to be 
part of a system. Actually to introduce the other particles of the system 
would greatly complicate the problem; but, in anticipation of their presence, 
we introduce the planes of simultaneity which will be determined by the 
external momentum vector of the system when it appears. We construct 
these planes in the geometrical space-time which previously contained no 
indication of a particular direction of section; we inscribe the matrix J5 45 , 
henceforth to be permanently associated with the planes of simultaneity, 
on the sheet which was previously blank. Simultaneity is no longer arbitrary ; 
we cannot modify the reckoning of it to suit a particular particle of the 
system. Suppose that the particle has a momentum in the ^-direction; by 
(6-51) the space vector representing this is E l& p l . Formerly the particle 
could be "reduced to rest" by a Lorentz transformation with matrix of the 
form J5? 14 itt 14 ; but we are no longer allowed to rotate the plane of simul- 
taneity. The strain of the system (as compared with a system in which the 
particle is at rest) is measured by the inhibited rotation E^iu^ . Apart from 
a numerical factor this is the strain vector E l5 p l .iE^ 5 = E 1 ^ip l associated 
with the space vector E-^p^ 

This example calls attention to another feature of the connection between 
internal and external space. The external velocity or momentum vectors 
correspond to internal displacement vectors, and vice versa. At present we 
can only recognise this in a preliminary way; but another example may be 



7-8] Strain Vectors and Phase Space 111 

of interest. If we apply this principle to the internal energy attached to the 
matrix E 19 , the corresponding space vector attached to the matrix 7S 45 
should be interpretable as an external time displacement. As in the previous 
example it will be an inhibited or virtual time displacement. If we change 
the scale of the system, we change the light-times between its various parts, 
and therefore change the time at which a particle becomes causally effective 
at the centroid where the external wave vector is supposed to be located. 
But in our internal wave functions the particles are assigned simultaneous 
times, not the times at which they are causally effective. Thus the distance 
of a particle from the centroid can be looked upon as an inhibited time displace- 
ment. As an isolated particle we should have contemplated it at an ante- 
dated instant to allow for the lag of causal efficiency; as part of a system we 
can only contemplate it at the same instant as the rest of the system. If it 
receives a radial displacement 8r, a further lag 8rjc is introduced which (if the 
particle were not considered to be part of the system) would be compensated 
by giving it a time displacement 8r/c. Since the planes of simultaneity in 
the system are fixed, we cannot give an individual time displacement to the 
particle; and the inhibited time displacement appears as, and is the measure 
of, a strain of the system. We have seen that this strain will be associated 
with the matrix jE7 16 and therefore be an internal energy. Thus a system will 
in general have an internal energy depending on its linear scale. This is one 
aspect of the origin of electrostatic energy .f 

Our present point of view is that, instead of starting with an elementary 
particle defined to be such that its properties can be represented by a com- 
plete space vector, it is rather less abstract to start with an elementary 
system defined to be such that its condition or "configuration" can be 
represented by a simple strain vector. Then by introducing the associated 
space vector we detach the active principle of the system from the passive 
principle represented by planes of simultaneity, and so obtain a still more 
abstract entity, namely an elementary particle in free space. It may seem 
far-fetched to describe a particle coupled with a plane of simultaneity as a 
"physical system". But that is as much of a physical system as we can 
represent by a simple wave tensor. And inasmuch as current quantum 
theory has made shift to treat a wide range of problems of observational 
importance with simple wave tensors, it is an important stage in the advance 
of the theory towards actuality. 

If a system A were completely isolated it would be unobservable. There- 
fore it enters into observation as part of a more extended system B. As such 
it will be represented in the internal space of B. The external momentum 

t Conversely, if we derive the electrostatic energy as in Chapter xv, the foregoing 
investigation shows how to connect with it the idea of antedating the particles by the 
light-time. 



112 Wave-tensor Calculus [7-9 

vector of B determines planes of simultaneity for the internal constituents 
of B\ and the external momentum vector of A, which was a space vector so 
long as A was considered in isolation, is replaced by the associated strain 
vector when A is considered as part of B. But B in its turn would be un- 
observable if it were not part of a larger system C\ its external momentum 
vector must therefore be replaced by a strain vector, representing B as a 
constituent of the internal structure of C. And so on. 

At each stage there is an external space vector of the system contemplated, 
which is converted into a strain vector when we consider a larger system 
until the system has been extended to comprise the whole universe, when 
the external vector can be dropped, all phenomena being comprehended in 
the internal structure. 

From this point of view space vectors make only a temporary appearance 
when we halt for breath in the course of the analysis. As soon as we are 
ready to proceed further, we replace them by their associated strain vectors. 
There are, however, two reasons why space vectors remain an important 
conception in practical investigation. Firstly, it is impracticable to treat 
the exact equations of the internal statew of highly complex systems; 
therefore when we have to deal with'a number of systems with weak inter- 
action, we leave them uncombined and treat their mutual influence (which 
brings them within reach of observation) by approximate perturbation 
methods. Secondly, in dealing with a large number of similar systems, we 
combine them statistically, not individually. The external space vectors are 
not transformed individually into internal strain vectors of the complex 
system; but are first replaced by a probability or average distribution. The 
internal state of the complex system is described by a very much simplified 
set of strain vectors embodying coefficients of the distribution function. 
Thus the internal state of a gas is described by pressure, energy density, 
virial, vorticity, etc., representing certain averaged characteristics of the 
external space vectors of the molecules. Normally the direct procedure of 
replacing space vectors by their associated strain vectors is not extended 
to systems greater than a molecule.f 

7-9. Singular Phase Space. 

The phase space which we have been considering is generated by the trans- 
formations of a non-singular strain vector. If we use instead a singular 
strain vector S a , the resulting phase space has fewer dimensions, since 
S a dQ 8 s=Q when d 8 is a pseudo-reciprocal of S a . Displacement in such a 
direction involves no change of the strain vector and therefore no change of 

t The averaging could perhaps equally well be performed on the associated strain 
vectors. But since space vectors are more familiar we generally adhere to them as long as 
we can. 



7-9] Strain Vectors and Phase Space 113 

configuration; the direction accordingly is not a dimension of the phase 
space of S a . 

To show the relation between singular and non-singular phase space, we 
consider a non-singular strain vector which at some point of phase space has 
a purely algebraic value S . By (6-64) this is analysed into spectral coiu- 

P nentS = S . (7-91) 



The components are pure and therefore singular. Consider the component 
S a . Since the products S a 8 b , etc. are zero, the singular directions, which 
satisfy S a d& 8 = 0, are ^ = 6/g& + ^ + dSd (7 . 92) 

Since d 8 cannot contain time-like matrices, the ratio b:c:d must be chosen 
so as to eliminate the time-like matrices J5? 23 , J57 45 . There is therefore only 
one singular direction d&.=c(8 + 8 d ). (7-93) 

If, as in (6-64), S f , = - iiS (#1 + # 23 + #45 + # 16 ), (7-941) 

the singular direction is d 8 =(-E l + # 16 ) d<f>. (7-942) 

For S c and S d the singular direction is d& 8 = (E l -f E 19 ) d<f). 

Non-singular phase space of ten dimensions can thus be regarded as a 
superposition of two singular phase spaces of opposite spin, each of nine 
dimensions. The axis of spin, here represented by E t , can be in any direction 
in three-dimensional space. When the singular phase space of 8 a is delineated 
in ordinary phase space, the added dimension (direction of </>) gives a line of 
indistinguishable configurations which are really one configuration. The 
equation of the singular line (7-942) may be written 

d<f> = cZ0j_ = dd^Q . 

Or, since dt = Rd6 IB , dOJdt = - I/ R. 

The singular line therefore represents an entity spinning with uniform 
velocity. We do not distinguish the different orientations corresponding to 
the sequence of points along the line, but count the state of spin as one con- 
figuration a constant state of strain of the system. The strain produced by 
the rotation is to be compared with the gyrostatic torque of a fly-wheel, not 
with the torque of a wound-up spring. A change of the plane of spin would 
be a change of configuration. 

When we separate non-singular phase space into two singular phase 
spaces, we require an additional variable to specify how the probability is 
divided between the two phase spaces. This extra variable compensates for 
the loss of a dimension in passing to singular phase space. It is to be remem- 
bered that S a + S c does not represent two particles of opposite spin; two 
particles would require a double wave tensor. It represents a particle which 
has equal probability of having either spin. 



114 Wave-tensor Calculus [7-9 

It is perhaps rather surprising that there is no corresponding separation 
of phase space for positive and negative charges. The physical reason appears 
to be that the conception of phase space is bound up with the conception of 
statistical equilibrium, and there can be no equilibrium if the charges (or 
the probabilities of positive and negative charges) do not balance. Equi- 
valently it is attributable to our result that electrically saturated space 
satisfies different reality conditions; and the phase space (if any) associated 
with it would require to be reinvestigated from the beginning. Although 
there is considerable analogy between opposite sign of charge and opposite 
spin, there is the fundamental distinction that opposite charges, unlike 
opposite spins, cannot be transformed into one another by relativity 
rotations. 



CHAPTER VIII 
THE DIFFERENTIAL WAVE EQUATION 

8* 1 . Conservation of Probability. 

In the terminology introduced by Diracf a "state" of a system consists of 
a particular distribution of probability over the various possible configura- 
tions. The state is supposed to extend over all time. The probability dis- 
tribution may vary with , but always so that the integrated probability of 
all the configurations for a fixed value of t is unity. In other words pro- 
bability is conserved. 

We may treat the probability as a fluid occupying the configuration space, 
the probability of a given range of configurations being represented by the 
mass of fluid in a corresponding volume. The change of distribution of 
probability is then represented by a motion of the fluid. 

The method of wave mechanics is to analyse the whole probability of the 
system, which must be unity, into the probabilities p a , p b , p c , . . . of a set of 
elementary states a, b, c, .... Then if ^(a^, t) is the probability of the con- 
figuration Xp at time t in the state a, the whole probability of the configura- 
tion X at time t is 



A. perturbation of the system means a variation ofp a , p b9 p c , ..., subject to 
their sum remaining unity. 

The object of this device is to separate the mathematics of interaction 
from the mathematics of structure; for the influence of extraneous bodies is 
described by changes of the factors p, and the functions q which describe 
the structures of the various states remain unaffected. 

Since the elementary states are introduced for analytical purposes, we 
may impose on them such limitations as appear advantageous. It would be 
possible to contemplate discontinuous flow of probability, whereby pro- 
bability is created at one point and disappears at another point in the state, 
subject to the total probability remaining constant; but in current quantum 
theory the states are assumed to be continuous, that is to say the fluid moves 
subject to the equation of continuity. 

We introduce a vector j^ whose component in the time direction gives the 
density of the probability fluid and whose space components give the 
density of its flux. The conservation of probability is then secured by the 
equation of continuity of the fluid 

div^-0. (8-12) 

t Quantum Mechanics, 1st ed. 



116 Wave-tensor Calculus [8-1 

This assumes that the configurations are represented in a space with Eucli- 
dean metric. More generally the density of probability and of its flux are 
represented by a vector density j^, and the equation of continuity is 

divj^O, (8-13) 

the covariant divergence of a vector density being the same as its ordinary 
divergence. 

For the general development of the theory it is necessary to use the phase 
space and strain vectors treated in Chapter vn; but in order to connect our 
treatment with Dirac's theory, we first consider a simple particle whose 
"configurations " are determined by its coordinates x, y, z in ordinary space. 
The probability distribution in state a is then a function q a (x, y, z, t). 

Since we admit perturbations, we have to recognise an independent 
variable s extraneous to the state as an argument for the perturbations, so 
that (8-11) becomes I^p. <).*. (a, y.M). <*14) 

The distinction between s and t is often ignored; but it is necessary to attend 
to it when we consider Lorentz transformations. We may apply a Lorentz 
transformation to x, y, z, t, and thus obtain a new but equivalent description 
of the probability distribution in the state; but the function p a (s) is a factor 
applying to the whole domain of x, y, z, and the time s which appears in it is 
not associated with any particular values of x, y, z; there is therefore no 
possibility of applying a Lorentz transformation to s, and it is invariant 
for the internal transformations of the state. 

The perturbations are imposed on the system from without, and occur at 
times fixed according to an extraneous time reckoning. If we like, we may 
choose the time t within the state so as to conform to this reckoning; but we 
then lose the possibility of applying Lorentz transformations to the state. 
It is undesirable to do this at any rate at the present stage since the 
study of the Lorentz transformations, initiated by Dirac, has contributed 
greatly to our understanding of the theory. To preserve this advance and 
exhibit it in its proper relation to the more general developments, we have 
to distinguish a relative time t in the state and an invariant time s which 
serves as a link with other systems.! 

If we wish to make a relativity transformation which includes both the 
perturbed and the perturbing system, we must treat them as one combined 
system and analyse the probability distribution into elementary states of 
the combined system. But then, for this combined system, there will be an 
invariant time 8 forming the argument of the perturbations of the combined 
system by systems extraneous to it. However many systems we combine 
there is always an invariant time left over for the purpose of representing 

t In the nomenclature introduced later in this chapter, t is a geometrical coordinate and 
a a dynamical coordinate. 



8-1] The Differential Wave Equation 117 

the perturbation effects of systems not yet included. It would be idle to 
consider a system without making provision for perturbation from outside; 
for in that case we could never acquire knowledge of it unless indeed it 
were so extensive as to include the brain of an observer. 

We note in particular that the time t is reversible, but the time s is irre- 
versible. The axes in the state can be rotated so that ->- 1, and there is no 
absolute distinction between waves travelling forward and waves travelling 
backward in t. The latter are oddly said to have negative energy or mass 
according to the technical definitions in quantum theory. But the inter- 
action with other systems depends on the invariant time s\ thus the rotation 
of t and the substitution of "negative mass" for positive mass does not 
signify an observable change it would not be a relativity transformation 
if it did. Similar considerations apply to more complex systems, until we 
reach a system which includes a human brain and is observed from within 
instead of from without. Extraneous time 8 is then no longer needed. But 
the inclusion of the observer in the system automatically prevents the 
relativity transformation from being pushed so far as to turn t into - 1\ at 
least it is unusual to include among the admissible systems of description 
(1-1) those of an observer whose consciousness runs backward in the 
adopted time reckoning, and who endeavours to predict the past from his 
memories of the future. 

By (8-14) the probability of a configuration x^ is a function of the two 
times s, t. The same duality of time occurs, for example, in perturbation 
theory in celestial mechanics; at each moment s there is an osculating orbit 
of a planet which professes to give the position of the planet for all times t 
between - oo and + oo. These positions are not actually realised. To obtain 
the realised positions we have to associate corresponding values of s and t. 
This means that in the present problem we have to lay down in the state a 
series of locif(x,y,z,t)=s which give a "representation" of s in the state. 
In particular cases it could be arranged that f(x,y,z 9 t) = t = 8', but, as 
already stated, this definite choice of t requires that we forgo the applica- 
tion of Lorentz transformations. 

The coordinate s, which is primarily.an extraneous time-variable but has 
also a representation in the state, is a connecting link between the system 
which is being described and the rest of the universe. This ideal type of 
connection cannot be exactly realised in practice. The approximation lies 
in the assumption that, whatever the source of the perturbation, its argu- 
ment s is represented by the same series of loci/ (x, y, z, t) in the state. Prac- 
tically this means that the velocity of propagation of all perturbations is 
assumed to be infinite. If we require greater accuracy, the only course is to 
amalgamate the perturbed and perturbing system and investigate the 
states of the combined system. 



118 Wave-tensor Calculus [8-1 

To insist on this greater accuracy would be a counsel of despair. It would 
be to tell the physicist that, since the whole universe is interrelated, it is no 
use his attempting to study anything less than the whole universe. It is 
obvious that it is profitable to study portions of the universe as isolated 
systems. The method of states provides for this, and at the same time makes 
provision for re-attaching these systems to the rest of the universe in an 
approximate way, so that they may not be entirely cut off from observation. 

It would be possible to choose the set of elementary states a, b, c, ... in 
many different ways. In practice therefore certain limitations are imposed. 
In particular the states are so chosen that the wave tensor SJ2^ j^ corre- 
sponding to the probability vector is factorisable. The state is then said to 
be pure. Further, the whole set of states must be complete without being 
redundant so that, when the probability distribution of the configurations 
at time t is given, the coefficients p n , p b , ... are uniquely determined by it. 
The conditions of purity, completeness and non-redundancy require that 
the states shall correspond to a spectral set of operators. The spectral 
set is, of course, much more general than that which we have introduced for 
resolving an isolated wave tensor ( 5-7). We shall not enter into the detailed 
treatment of these conditions, since our theory here coalesces with current 
quantum theory. 

8* 2 . The Divergence Condition . 

For a system describable by simple wave tensors the probability vector j^ 
representing a pure state will be of the form 



We introduce two divergence operators 

16 

V-S^a/eteg, V 
where 8/80;^ signifies 3/9^ written after its operand. Then sincej" = 



16 16 

S^a/eteg, V* = 2(8/8^)^, (8-21) 



so that, summing for /A= 1, 2, ..., 16, 

divj = S (dj^dxj = - Jx* ( v * + v ) 0- (8-22) 

It is understood that the same frame E^ is used throughout the whole 
domain of x^ in which and x* extend. 

The result of performing the operation V on $ is another four-valued 
quantity o>. We can always find a matrix M such that co = M ift. Generally 
M will be a function of the coordinates. We therefore set 

Vt = Mif>, x*V* = x*M*, (8-23) 

so that div j = - ** (M* + M) 0. (8-24) 



8-2] The Differential Wave Equation 119 

We ensure the vanishing of divj by setting Jf*= If; so that (8-23) 
gives the differential wave equations 

(V-Jf)^ = 0, x *(-V*-Jlf) = 0. (8 . 25 ) 

These wave equations will be invariant for all wave-tensor transforma- 
tions, if V, V* and M are mixed wave tensors. This requires that 3/3a^ shall 
be a complete space vector; that is to say, when a relativity transformation 
is applied, it is transformed into a new array of operators 3/3^' which are 
linear functions of d/dx^ , just as if it were a numerical vector. The condition 
that 3/Sa^ is a complete space vector is equivalent to the condition that x^ 
is a complete space vector. 

If we limit ourselves to solutions of (8-25) which are functions of four 
rectangular coordinates (x l9 x 2 ,x 3 , # 4 ), the equations reduce to 



s <8 ' 261 > 



These equations are invariant only for the six relativity rotations in four 
dimensions; because the other relativity transformations (applicable to 
(8*25)) would reintroduce the terms that have been dropped. 

Dirac's wave equations for an electron of proper mass m in an electro- 
magnetic field, whichgivesit an energy and momentum K^ , can be written as 



(8-262) 
Comparing (8-261) and (8-262), we obtain the identification 



(8-271) 
Writing as usual M = S m^ E^ , we have 

m le = m (wij , w 2 , w 3 , w 4 ) = - i fa , K Z , /c 3 , * 4 ). (8-272) 

We have made this comparison with Dirac's equation in order to ascertain 
the current nomenclature for the components of our space vector M. The 
consequences of his equation have been worked out and compared with 
experiment, so that we know how the quantities m, K^ contained in it 
manifest themselves observationally. Equation (8-272) transfers this 
knowledge and the consequent nomenclature to our own equations (8-261). 
It would be foreign to our plan to intermingle the current semi-empirical 
theory with the purely deductive theory that we are developing, and our 
reference to the current equations is for identification purposes only. In 
current theory m is supposed to be a constant independent of the coordinates. 



120 Wave-tensor Calculus [8-2 

We have as yet shown no reason why m should be constant; that is a con- 
sequence of the dynamical equations, and will be proved in 9-2; but it will 
be assumed here in anticipation. 

If ^-iJL + ^i^.^, (8-28) 

(8-262) reduces to 



which formally agrees with the simple wave equation as given in (5-13), 
(5-14). But this is only a real agreement if (8-28) gives an algebraic value of 
Ppl for in (5-14) the p^ are necessarily algebraic coefficients. 

In order that the p^ defined by (8-28) may be algebraic, and x* must be 
eigensymbols of d/dx^ . This requires that shall be a function of x^ of the 

form Jf(x x x. x ) = /(# x x x ) Ji (8-285) 

where / is an algebraic function and a constant wave vector. We shall 
call wave functions of the form (8-285) algebraic wave functions. 

It would be undesirable to exclude non-algebraic wave functions. We 
shall find, for example, that the wave functions giving the steady states of a 
hydrogen atom are non-algebraic. Thus for unrestricted wave functions the 
momenta p^ are non-algebraic ; and the identification of p with the algebraic 
coefficients^ in 5-3 does not apply. 

Thus in general we have two independent equations 

= 0, (8-291) 

= 0. (8-292) 

The first is an identity, except that it is implied that the axes are so chosen 
that J 5 = 0. The second expresses the conservation of probability. For 
algebraic wave functions the two equations are the same; so that the 
momentum vector is the same as the stream vector except for a numerical 
factor. For non-algebraic wave functions the two equations are distinct; 
and no comparison between the stream vector and momentum vector is 
possible, since the components of the latter are non-algebraic quantities. 
We can rewrite (8-292), setting 

4 16 



where the p ' are algebraic coefficients; then the momentum vector can be 
regarded as a complete space vector p^', which has been formally reduced to 
four dimensions by the device of admitting symbolic components. 

If there is no electromagnetic field, (8-262) has the elementary solutions 

^ (8-293) 



8-3] The Differential Wave Equation 121 

where the p^ are algebraic, and ^ , Xo* are solutions of the elementary wave 
equations (5-13). 

In this book we shall use non-algebraic wave functions in special in- 
vestigations, but in pursuing the more fundamental problems we shall 
generally limit ourselves to algebraic wave functions. That is because we are 
especially concerned with the borderland of relativity and quantum theory, 
and, as explained in the introduction, their meeting-point is to be found in 
the most uniform conditions. 

According to (8-28) there are two different formulae for y^ . At present the 
position (final or initial) of the wave vector is a sufficient indication which 
formula applies. But it is hampering to have a definition of p which pre- 
vents us from changing the position of the wave vector; and we shall later 
describe the difference of the two operators in a more general way ( 8-6). 

8- 3 . Govariant Differentiation . 

We do not propose to generalise our formulae to apply to all kinds of curvi- 
linear coordinates; but there is one kind of curvilinear coordinate which it 
will be necessary to use, viz. an angular coordinate. Consider the trans- 
formation from rectangular coordinates (x l9 x 2 ) to polar coordinates (r, 12 ) 
in a plane; the question arises how this change is supposed to affect a general 
space vector jL and its wave- vector factors ^r, x*. 

Our theory of has been based on an orthogonal frame of reference, and 
it would be a grave complication to depart from an orthogonal frame. 
Therefore we do not consider the polar components of a vector in the sense 
of general relativity, but rather in the sense of elementary mechanics in 
which the "polar components" of a force are its rectangular components in 
the radial and transverse directions.! That is to say, our transformation 
will correspond to (dx^ , dx 2 ) -> (dr, rd0 12 ), not to (dx^ , dx 2 ) -> (dr, d0 12 ). Thus 
we retain an orthogonal frame, but the frame rotates as 12 changes; and a 
local vector 0x* * s resolved orthogonally in continually changing directions. 

We shall now find the covariant derivative of iff with respect to 12 . By 
(4-15) the transformation $' = C* AI ^" gives J/ = ^ cos 12 - 1 2 sin 12 > which 
is the change of t due to the axes having been rotated through an angle 
12 . In the present case, when the angular coordinate changes from to 
12 , the axes are rotated in the same direction so that the corresponding 
transformation of is y = e -+* u e^ m (8-31) 

This assumes that there has been no "real" change of 0; that is to say, if we 
had kept to rectangular coordinates, ^ would have had the same value at 

t There is a distinction between "a transformation of coordinates from (a;, y) to (r, 0)" 
and "a transformation from rectangular coordinates (x, y) to polar coordinates (r, 0)". 
The first implies that all coordinates are to be treated alike as in general relativity; the 
second implies that they are to receive the distinctive treatment usually accorded to rect- 
angular and polar coordinates. (Of. Mathematical Theory of Relativity, 16.) 



122 Wave-tensor Calculus [8-3 

(r, 12 ) as at (r, 0). We call displacement without real change parallel dis- 
placement. Differentiating (8-31), the change due to parallel displacement is 
3^7^12 = ~"i^i2^'- For a general displacement we obtain the covariant 
derivative by subtracting from the apparent change d*fi'/dO l2 the change 
arising by parallel displacement. Thus the covariant derivative operator is 

>. (8-321) 

(r\ \ / 7\ \ 

- - - ) = - 1 1 - + p? y | (8-322) 

vUpv/c \vOpv / 

is called the angular momentum conjugate to the coordinate 6^ v . The term 
\iEpy is called the spin momentum. It was originally discovered as a 
correction to the angular momentum, which had previously been assumed 
to be id/ddpy . We see that the spin momentum is merely the difference between 
the covariant derivative and the ordinary derivative. It is a nominal addition 
to the angular momentum due to our non-relativistic outlook. 

By (3-38) the expectation value of E^ is ij^/jw, so that the expectation 
value of the spin momentum is ^j^/j^. The term "spin momentum" is 
primarily limited to the components j M , j 31 , j 12 ; but analytically all 
components are on the same footing, and we have a complete space 
vector //2j 16 giving the part of the momentum (expectation value) which 
arises from the difference between covariant and ordinary derivatives. 
From this aspect the stream vector J is also a momentum vector. The 
identification in 6-6 of its E^ component with mechanical spin is thus 
confirmed and elucidated. 

For an initial contravariant wave vector #*, the corresponding operators 
are 



(8 ' 332) 

Hence the covariant derivative of a space vector J = ^^* is 



, + * (E^ J - JEy,). (8-34) 
The angular momenta generally referred to in quantum mechanics are 
conjugate not to the angles 6^, but to angular parameters - ^ v introduced 
in the following way. Consider a distribution of ^ constituting an elementary 
state of a system. If the boundary conditions, extraneous electromagnetic 
fields, etc. are symmetrical in the plane of 0^ , we obtain another elementary 
state by rotating the whole distribution of through an angle a F in that 
plane. Thus we obtain a series of distributions iff(x l9 x 29 x 9 ,Xt, V) *i> *i 



8-3] The Differential Wave Equation 123 

x a , # 4 being coordinates, and a^,, a parameter distinguishing one distribution 
from another. In place of x ly x z , x 3 , # 4 we can use polar coordinates, in- 
cluding the angle B^ v . We have 

since the first term in the exponential gives the effect on $ of rotating the 
axes backward through a^,,, which is equivalent to rotating the whole dis- 
tribution forward through oc^, and the second term is introduced because 
the point 0^ in the new distribution corresponds to the point 6^ v - oc^ in 
the old distribution. 

Differentiating (8-35) with respect to a^, we have 

(8-361) 



and the angular momentum conjugate to oc^ is 



For x* the corresponding operator is 

(8-363) 



We shall call a^,, a dynamical coordinate, and distinguish M^ and M IW as 
geometrical and dynamical angular momenta, respectively. The importance 
of M v is that (in the symmetrical conditions already postulated) it com- 
mutes with the hamiltonian; this, as we shall see later, makes it a constant 
of the motion of the system, which is then said to possess an integral of 
angular momentum. But when the conditions are not symmetrical, and no 
integral of angular momentum exists, there is no reason to suppose that 
(8-362) represents angular momentum of any kind. It is therefore rather 
misleading to say that -*(3/3fl^-i^ f f4 J is the angular momentum of a 
system; it is a form to which the dynamical momentum reduces in parti- 
cular cases when it happens to be constant. On the other hand, M^ has 
the same interpretation for symmetrical and for unsymmetrical systems. 
Whilst it has apparently little connection with momentum as conceived 
in classical mechanics, it is a natural generalisation of the quantum theory 
definition. 

To reach the dynamical outlook we must promote <x 12 , originally intro- 
duced as a parameter, to be a coordinate; so that ^ is a function of five 
coordinates <x 12 , x l9 x 2 , x z , # 4 , or in polar coordinates a 12 , 12 , <, r, t. (The 
configurations occupy four dimensions 12 , <, r, t, as before.) This means 
that a series of elementary four-dimensional states is run together to form 
a single five-dimensional state; and we adopt a new dissection into states 



124 Wave-tensor Calculus [8-3 

in which the five-dimensional state is regarded as elementary .f In combining 
the four-dimensional states we attribute to them a uniform probability 
distribution in a 12 . We can now assign only one probability factor p to the 
whole five-dimensional state, whereas formerly we could assign different 
probability factors p a to each orientation a of the four-dimensional state. 
This sacrifice corresponds to the fact that, the conditions being perfectly 
symmetrical, it is impossible to distinguish the orientation observationally, 
and therefore we never have occasion to consider a modification of the 
initially assumed uniform probability distribution in orientation ("a priori 
probability") through additional information furnished by observation. 

The usual method of obtaining (8-362) is to consider a form of hamiltonian, 
which by its symmetry ensures that the dynamical angular momentum is 
constant, and then to identify an analytical expression which turns out to 
be constant as the angular momentum. This gives no indication of the ex- 
pression for a non-constant angular momentum. For example, in an asym- 
metric electromagnetic field, systems, besides being deformed by the field, 
will tend to orient themselves in a certain way. Even if we ignore the 
deformation and assume that exactly similar four-dimensional states can 
exist in different orientations a, it will be necessary to insert in (8-35) a 
probability factor p^ representing the unequal probability of distribution 
of the different orientations; so that there will be an additional term in 
30/8a. It is clear therefore that (8-362) is not the correct expression for the 
angular momentum in unsymmetrical conditions. 

In setting divj = S Sj^/dx^ ( 8-2), we assumed that the x^ are rectangular 
coordinates, the volume element being taken to be dV = dx 1 dx 2 dx s ... so 
that the probability or probability flux belonging to an element isj^dV/dx^ . 
If angular coordinates are used, two courses are open. The simplest is to 
introduce the vector density j^j^V -g; the equation of conservation is 
then (dj^/9^) = 0. Otherwise we must substitute covariant derivatives 
in place of ordinary derivatives in V. 

It is instructive to check the agreement of the two methods. If E f is the 
matrix corresponding to the radial direction, and E e the matrix for a rota- 
tion in a plane containing the radius (therefore anticommuting with E r ), the 
matrix for the corresponding transverse direction is Eg E r . The corresponding 
term in V is 



-** + - (8 ' 37 > 

t The running together of the configurations along a singular line in 7-9 to form a single 
"state of spin" of an elementary particle may be regarded as an elementary example of this 
procedure. 



8-4] The Differential Wave Equation 125 

Thus for each rectangular coordinate replaced by an angular variable, 
E r /2r is added to V. If there are n angular variables, the result is to change 

the radial term E r dldr to 

""" (8-38) 



If we set 0=r * n ^ and take ^ as a new wave function, the extra term 
\n\r is eliminated. We find that x* behaves in the same way; so that, setting 
X* = r~* w co*, the extra term is eliminated in the equation for o>*. (In 
treating #* we must notice that Eg E r is replaced by E r Eg , and this cancels the 
change of sign of the spin correction.) Accordingly we have a new com- 
bination 3 = <!><*>* = r n J which satisfies the simple divergence equation with- 
out the added terms. This will agree with our former result if 3 is the vector 
density JV^gi hence V g should be equal to r n . This is correct, because 
each substitution dx^rdO^ contributes a factor r to V g. 

8-4. General Dynamical Equations. 

Since our symbolic calculus has been extended by the introduction of 
differential operators djdx^ , which I will call Z)-symbols, it is necessary to 
refer again to the formal definitions in 2-1. 

A symbol which commutes with every symbol in the calculus will be 
called an algebraic number as heretofore. A symbol which commutes with 
all symbols other than D-symbols will be called an algebraic function.^ 

Since D ($x) ^ (D$) x, the i>-symbols do not obey the associative law of 
multiplication. IfD x = d/dx, DJx = 9 (fx)ft x - Hence 



whatever x ma y be. We have therefore in all cases 

Sfldx=DJ-fD c . (8-41) 

The introduction of Z)-symbols leads us to contemplate a wider variety 
of tensor transformations, i.e. a wider variety of systems of description of 
a physical system. We have been using transformations of the form g = e w , 
where is a matrix or more generally an JS-number. It is natural now to 
admit still more general transformations in which may be any combination 
of symbols, including D-symbols. In particular we consider the trans- 
formation q== e iW8 , (8-421) 

where W is any symbolic expression and s is an algebraic parameter. 

Let x*> $ be initial and final eigensymbols of W, the eigenvalue m being 
the same for both, so that 

X*(JF-m) = 0, (TF-w)0 = 0. (8-422) 

t The term ''algebraic function" does not include "algebraic wave functions", defined in 
(8-285) as the product of an algebraic function and a constant wave vector. 



126 Wave-tensor Calculus [8-4 

Transforming these as contravariant and covariant wave vectors respec- 

tively, we have *> _. * e -**r ^ * e -ims 

x x * ' (8-423) 



so that, if J = 0**, . J' = J. 

If T is any other mixed tensor of the same class as J, 

T' = e* w *Te~ iw *. (8-424) 

Hence for an infinitesimal change ds 



so that dTlds = i(WT-TW). (8-43) 

To apply this, we regard s as a coordinate, and consider a physical system 
S described by mixed wave tensors T or the equivalent space vectors. When 
(8-43) is satisfied, displacement along s is parallel displacement of the system; 
that is to say, the tensors describing the system are unchanged except that 
they undergo a common transformation which we interpret as a trans- 
formation of reference frame. 

It is important to understand the significance of this association of ci 
tensor transformation with every displacement. If the system S is the only 
system contemplated, a change of the system of description has no useful 
purpose; but then it is idle to talk of displacing the system, since there is 
nothing to which the displacement can be referred. It is therefore pre- 
supposed that there exists besides S a reference system S' to enable the 
displacement to be recognised; and the displacement constitutes an in- 
trinsic alteration or strain of the combined system 8, 8'. The transformation 
of the system of description, applied to S but not to /S', expresses the fact 
that although S is intrinsically unchanged, its relation to 8' has been 
altered. 

In general relativity we are familiar with this change of the system of 
description which necessarily accompanies every observable displacement. 
It is expressed by the fact that an extended system of rectangular co- 
ordinates is impossible, or equivalently that the curvature of physical space 
cannot vanish. The effect of curvature is that displacements in different 
directions do not commute. In symbolic calculus we are indifferent to 
geometrical pictures, and express the same thing more directly by associating 
non-commuting operators with the displacements. The change of description 
is directly associated with non-commutation of operators, as may be seen 
from (8-43); if the operator W associated with the displacement s commutes 
with all the tensors T of the physical system no change of description 
occurs. 

The displacements whose associated operators are ^/-numbers can be 
represented in Riemannian space. It is, I think, improbable that the 



8-4] The Differential Wave Equation 127 

displacements associated with the more general operators now admitted 
are representable in Riemannian space. If we wish to represent them 
graphically we must adopt whatever form of geometry is necessary to 
provide representation of their non-commutative relations. In practice, 
however, we are not much concerned with the totality of transformations 
of the type (8-421); we have only to pick out a few special forms which yield 
comparatively simple systems.! 

The coordinate s is a dynamical coordinate of the same type as the co- 
ordinate aip V introduced in 8*3. It will be remembered that we there con- 
sidered a probability distribution in space-time and changed its orientation 
through an angle a.^ . Since the distribution was intrinsically unaltered, this 
was a parallel displacement. The space vectors describing the probability 
distribution at all points of the system underwent the same transformation 
(8-35). Since for any wave vectors ^r, #*, the infinitesimal transformation is 



we have W = - i d/fo = i 8/85, (8-44) 

so that W is the momentum conjugate to s. It will be seen that M^ and oc^ 
form a particular case of W and s. 

The distinction between geometrical and dynamical coordinates is rather 
obscured by the fact that in the most familiar system of coordinates, viz. 
rectangular coordinates, the conjugate momenta are the same. The distinc- 
tion is necessary in angular and other curvilinear coordinates because the 
conjugate momenta differ, and indeed are scarcely comparable in concep- 
tion. Further, by generalising our operators, we have introduced dynamical 
coordinates which may not be representable in the same space as the 
geometrical coordinates and may therefore have no counterpart in Rieman- 
nian geometry. The essential difference is that the geometrical coordinates 
express the internal relations of a system or probability distribution, and 
the dynamical coordinates express its relations to external objects. Geo- 
metrical coordinates are internal; dynamical coordinates are external. In 
elementary theory the only changes of external relations contemplated are 
those corresponding to change of position or orientation; but by means of 
general dynamical coordinates we can introduce external variables which 
more closely correspond to the internal strain vectors of the extended 
system which comprises the external reference objects. 

Equation (8-43) is a point of junction of the present theory with 
Dirac's theory. He arrived at it by seeking what he regarded as the 
most natural adaptation of the classical equations of motion to quantum 
conditions. 

t E.g., the forms W, U 19 U s , U a in 9-2. 



128 Wave-tensor Calculus [8-5 

8*5. Extension to Four Dynamical Coordinates. 

Let W, U L , U% 9 U 3 be independent symbolic expressions which mutually 
commute. We may expect to be able to find a common eigensymbol for 
them. It was shown rigorously in 3-7 (e) that a common eigensymbol can 
be found if the commuting symbols are matrices, and the same proof applies 
if they are symbols which satisfy algebraic equations. It is not clear that it 
applies to operators such as d/dx which have an infinite number of eigenvalues . 
But no inconsistency can arise through postulating a common eigensymbol 
for mutually commuting symbols, and the only limitation on the invention 
of symbols is that we must not ascribe to them properties which are not self- 
consistent. It would seem therefore that, if we cannot find a common 
eigensymbol for W, U l9 U 2 , C/ 3 , we are at liberty to invent one just as we 
have invented a square root of 1 in algebra. 

For us it is sufficient that there exist important applications in which 
initial and final eigensymbols $, x* of the four operators can be found. An 
example will be given in 9-2. Let the eigenvalues be w, p, l9 //, 2 , /* 3 . Then 
the transformation (8-423) can be extended to 

^' __ gi^+tfiSi-t-l^+t^s) ^ _ e lXw*4Vi l * 1 +fl2*2+/*3*3> iff. (8-51) 

As before the product e/ = ^x* is unaltered by the transformation. For 
other mixed tensors T we have in place of (8-43) 

dTjds = i(WT-TW), dTfa^i^T-TUJ, etc. (8-52) 

and W, U l9 U 29 Uz are the momenta conjugate to the dynamical coordinates 
s 9 s l9 s 29 s 3 . By (8-52) they are constant over the whole domain of (s 9 s l9 s 29 s 3 ) . 

We shall call the four-dimensional domain of the coordinates s, s l9 s 2 > 5 a 
an $-space. It is a very simple kind of space perhaps simpler than 
Euclidean space but it is unfamiliar since the axes in it are antiperpen- 
dicular. All displacements in it commute, so that it is pictured as flat; but 
in other respects it is not comparable with the space of ordinary conception. 

An elementary example of an S-space is obtained by taking 

W, U 19 C/ 2 , */ 3 = (E IQ9 E^ Ep 9 E y )m, 
where J5? a , Ep 9 E y form an anti-triad. We have then for all wave vectors 

t(8, *, *0, 5 y ) = e< ie*+^^^^+ A Vy)^^ , (8-53) 

which reduces to $(s 9 * a , s f , 5 y ) = e fw ^ ^ ^ v>0 , (8-54) 

when is a common eigensymbol of E (X9 Ep 9 E y . 

Alternatively we can describe the same domain by spectral coordinates 
s a9 s b9 s C9 s d conjugate to momenta J tt , J b9 J C9 J d defined as in (5-71). 

In the most elementary problems we cannot have more than four 
dynamical coordinates, since not more than four independent ^-numbers 
can mutually commute. For that reason we have chosen to consider four 



8-5] The Differential Wave Equation 129 

dynamical coordinates in this section. The same general dynamical theory 
applies to any number of coordinates; but the problems involving four 
coordinates will be classed as one-body problems, and therefore come first 
in our order of treatment. 

Of the four dynamical coordinates one is singled out to be the proper 
time s, and its conjugate momentum W is called the hamiltonian.f The 
reason for this selection must lie outside the system itself; for the dynamical 
equations (8-52) are perfectly symmetrical. The distinctive property of s 
can only appear when we contemplate the system in relation to other 
systems. It is, as we have seen ( 8*1), the argument of the perturbations of 
and by other systems. The peculiarity that the system "goes on" in s, 
whereas it is merely extended in s l , s z , 3 , is explained if it is through s that 
changes in the system are linked to changes in the external world and 
therefore ultimately to the time sequence in consciousness. 

The principle that the separate physical systems into which we dissect 
the universe shall each have just one coordinate in common with the rest, 
is valuable as expressing the conceptions which are the basis of our nomen- 
clature. It is not so important that it should be fulfilled rigorously, since any 
supplementary coupling can be dealt with by perturbation methods. In 
adopting s as the unique link we assume an idealised standard environment 
of the system, such that any change in the environment produces effects 
which occur simultaneously in all parts of the system according to the time 
reckoning s. In special cases we may have to treat an environment which 
deviates markedly from this standard. If light waves are falling on an atom 
in a particular direction, we should take account of the fact that the per- 
turbation travels across the atom with the velocity of light. It would seem 
therefore that the distinction between s, s l9 s^ 9 $ 3 is a matter of degree; and 
that all four coordinates afford potential linkages with external systems of 
appropriate character, though only one is called into play by the standard 
environment which our equations presuppose. 

To sum up: it is idle to treat in our equations a system supposed to have 
no interaction with its environment, since the interaction is the only thing 
about the system which concerns observational physics. On the other hand, 
it is not necessary to go to the other extreme and treat a system with an 
environment of the most general kind that can occur in nature. Just as we 
begin by studying the simplest systems, so we begin by studying the 
simplest form of environment, capable of introducing only the simplest 
type of perturbation. Under these conditions one coordinate 8 plays a 
unique role, and becomes distinguished from s l9 s 2 > *s- 

t In our nomenclature. On all points which concern the relations of s and t our Out- 
look differs so much from the current theory that comparison of nomenclature is scarcely 
possible. 



130 Wave-tensor Calculus [8-6 

8-6. The Differential Wave Equation for a Strain Vector. 

The tetrad of matrices in the wave equations (8-261) or (8-262) may be taken 
to be either E l9 E^ E 3 , J0 4 or J5 16 , jE? 25 , J5 36 , j 46 . The choice is not entirely 
a matter of indifference, because the two tetrads lead to different reality 
conditions. When the p^ are algebraic, the former is the appropriate tetrad 
for a (four-dimensional) vector density 0^*> an d the latter for a space 
vector 0x*, by (6-51). Prom considerations of continuity the same distinc- 
tion must hold for non-algebraic wave functions. 

Accordingly the wave equations without electromagnetic field for a 
space vector 0x* are 



- < 8 - 6i2 > 

Let <* = ix*^45 > so tha ^ 8 = W* is the associated strain vector. Substituting 
in (8-612) we obtain 



<>' (8-62) 
which is equivalent to 

^ (8-631) 



since E^ is the only antisymmetrical matrix in (8-62). Multiplying (8-611) 
initially by E^ we obtain 

4+^4+g| + ** m )*- - < 8 ' 632 > 

Thus the covariant wave vectors 0, ^ are solutions of the same, differential 
equation. 

The equations for $ and ^ are, however, not the same when there is an 
electromagnetic field. In (8-611) and (8-612), m is replaced by 



so that in (8-62) and (8-632), E^m is replaced by 



But in passing from (8-62) to (8-631) the reversal of sign applies only to 
JS? 45 m. The equations for and ^, including electromagnetic terms, are most 
conveniently written 

=0 > (8-641) 



8-6] The Differential Wave Equation 131 

Remembering that i is contained implicitly in # 4 , * 4 and in the imaginary 
matrices JS7 14 , E^ , 1 34 , we see that if the sign of (8-642) is reversed it becomes 
the complex conjugate of (8-641). So that if (a? 1 # 2 #3* is a solution of 
(8-641), its complex conjugate is a solution of (8-642). There are, of course, 
other solutions of (8-642), and it is not necessary to suppose that the two 
wave vectors representing a particle are complex conjugates. 

The wave functions adopted by Diracf and used, I think, in all current 
treatises are $, <f>, not ^r, x- Even when ^ = 0, so that ty and <f> satisfy the 
same equation (8-631), they are taken to be different solutions representing 
waves travelling in opposite directions in four dimensions. This comes 
about because the momentum operators applying to them have been defined 
differently in (8-28); so that when the momentum has a given value (the 
same for *fi and </>) different functions are required. 

We shall call wave functions whose momentum operator is idfiXp + Kp 
wave vectors of index 1, and those whose momentum operator is id/dXp + Kp 
wave vectors of index 1. The former satisfy (8-641) and the latter (8-642). 
The definition will later be extended, so that a wave tensor is said to be of 
index n if its momentum operator is 

v (8>65) 



This applies to covariant, contravariant, initial or final wave tensors, 
d/dXp being changed to S/Sa?^ when the tensor is written initially. 

We have seen that <f> may be the complex conjugate of $. Dirac goes further 
and defines </> as the complex conjugate of 0. In the present theory there is 
no reason to impose this restriction, which is presumably a survival of the 
Hermitic conditions employed in Schrodinger's elementary theory. These 
are superseded by the reality conditions found in Chapter vi. For algebraic 
wave functions (8-285), ^r and ^ can be chosen independently from the 
infinitude of solutions of the elementary wave equation, and there is in 
general a similar independence of non-algebraic wave functions. 

Let us, however, consider for a moment the current theory which takes 
^ to be the complex conjugate of 0. The. full specification of the system is 
then contained in a single wave function 0; for we do not add anything to 
the specification by inventing a special symbol for the complex conjugate. 
The system might equally well be specified by <f>; then is merely a symbol 
for the complex conjugate of </>. But and <f> represent waves travelling in 
opposite directions in four dimensions. It may be asked, Which are the 
real waves, or are there waves in both directions ? The answer is that there 
are no real waves. I suppose that no one nowadays attributes objective 
existence to the waves described in wave mechanics. 

f Quantum Mechanics, 2nd ed., p. 255, equations (9) and (10). 



132 Wave-tensor Caleidus [8-6 

Since then the system is specified by a single wave vector 0, the most 
natural wave tensor of the second rank furnished by it would seem to be 
00* (or, with suffixes, 000), i.e. the outer square of 0. If 



where p^ is algebraic,f we have 

(-iO/a^+^W^W*. (8-66) 

Thus 00* is of index 2. By introducing the complex conjugate an alternative 
wave tensor 00* of index is obtained. Attention seems to have been devoted 
exclusively to the latter. It is of considerable importance; but we must not 
let it unduly divert attention from the primary wave tensor 00*. 

As already stated, we do not accept the limitation in Dirac's theory which 
reduces the specification of a particle to a single wave-vector function. The 
space vectors and strain vectors which comprise the ordinary vectors of 
physics are wave tensors of the second rank. We resolve these into pure 
constituents, which are factorisable into wave vectors. In general there is 
no reason to expect or require that the two factors shall be equal. I have, of 
course, no objection to the employment in quantum physics of wave tensors 
which are perfect squares, if these are appropriate to the problems which are 
studied as is sometimes the case. But to regard it as more than a casual 
adaptation creates an artificial gulf between quantum theory and relativity 
theory, since there is no such limitation in the latter. 

Thus a pure strain vector of index 2 will normally be the product of 
unequal factors /S 2 = 1 2 *. Exchanging one of these for its complex con- 
jugate, we obtain an associated strain vector of index 0, /S = 1 2 *. The 
latter is the strain vector we have been studying, but we shall now turn 
attention to S 2 . S 2 has the advantage that there is no need to factorise it. 
The momentum is given by the operator (8-65), used as in (8-66). Factorisa- 
tion is only needed for wave tensors of index 0. For them the operator (8-65) 
is indeterminate, and it is necessary to find a factor which is not of index 
so as to obtain the momentum. 

At present we treat only wave vectors of index 1. We may note, how- 
ever, that there is a possibility of extending the theory to wave vectors of 
any index n, integral or fractional, p^ being always given by (8-65). 

8*7. Application to Phase Space. 

In Chapter vn we have described a system by a strain vector which specifies 
simultaneously the configuration, the probability of the configuration, and 
the time. This cannot be the strain vector /S = 00*, which was introduced 
for the purpose of specifying the probability only, and is in fact independent 

t Or, more generally, if p^ is an ^-number containing only space-like matrices and there- 
fore commuting with 0. 



8-7] The Differential Wave Equation, 133 

of the time. The strain vector which generates phase space is the associated 
strain vector of index 2, viz. /Sg-^^j*. 

Consider a free particle in field-free space, so that the ordinary wave 
functions are = e iww ^ > *-*-*"*), (8-711) 

where ma =p 1 x 1 +p*x* +Pz^p Q t. 

Their product is the strain vector /S . Alternatively we specify the particle 
by wave functions <Ai> ^2> both of index 1, 

0i=e iw *(Wo> fc-d'-foJo. ( 8 ' 712 ) 

Their product is the strain vector 

S 2 = e 2 *m*OS a ) . ( 8 . 72 ) 

For a displacement <fc, the only change produced in & 2 is a change of algebraic 
phase dO l9 = 2mds. If displacement in time (proper time) is expressed in the 
same linear measure in phase space as in ordinary space, so that d0 16 = dsjR, 
we have 2rads = d$/-R, so that m=H2R (8-73) 

where R is the radius of the phase space. 

At first sight it is anomalous that the general displacement (dx l , dx 2 , d# 3 , 
dx) should be interpreted as change of time only, and not change of con- 
figuration. But the plane wave solutions (8-712) presuppose flat space-time. 
If they are used in curved physical space-time, they must be restricted to 
regions not too large to be treated as flat. This means that the region of 
phase space which they cover is not too large for the distinction between the 
different configurations to be neglected. The apparent discrepancy is thus 
due to the nature of the approximation assumed in plane wave solutions; 
although they formally distinguish configurations by coordinates, they 
suppose that the distinctions when expressed by matrices are so inconsider- 
able that they can be neglected. 

We have suggested ( 5-4) that wave vectors are introduced mainly to 
secure purity of the wave tensors, and that many if not all of the problems 
of quantum theory could be solved by using the wave tensors directly. 
Current theory gives a rather fictitious importance to the vector factors, 
because it recognises only wave tensors of index 0. These do not contain the 
factor e im *i and, since the energy m is one of the most important character* 
istics of a system, it is necessary to examine the factors in order to find it. 
This is avoided by the use of wave tensors of index 2, which contain the 
factor e* im *. There is then no need to have recourse to the wave vectors, at 
any rate so far as the calculation of m is concerned. 

The position may be summarised as follows. Certain properties of a system 
are naturally described by constant symbols, e.g. a steady distribution of 
probability or probability flux. When these are factorised, they are resolved 



134 Wave-tensor Calculus [8-7 

into components 0, </> whose time factors cancel one another. But, since the 
time factors cannot be supposed to exist solely for the purpose of cancelling 
one another, this is a tacit admission that the constant symbols do not 
comprise the whole data of the system, and that in a complete description 
time factors must appear. Chapter vii gives this more compendious descrip- 
tion in terms of strain vectors variable with the time; they are factorised 
into components whose time factors reinforce one another. 

The general dynamical theory of a system described by strain vectors of 
index 2 is analogous to that developed for space vectors in 8*4, 8*5. For 
a transformation q = e iw * 9 we have 

S' = ^'<*' = e iwd * . ^e iWda = e iwd8 Se iWd8 , (8-74) 

where W<f> = <t>*W. (8-75) 

The determination of W from W presents no difficulty, remembering that 
(djdx)<fr = <l>* (8/8x). From (8-74) we obtain the general dynamical equation 
for strain vectors 



(8-76) 

The transformation introduces a strain common to all the strain vectors of 
the system. It may be regarded as defining parallel strain in the same way 
that the transformation of the space vectors in 8-4 defines parallel dis- 
placement. The dynamical coordinate s measures a progressive parallel 
strain of the system. In elementary examples parallel strain is merely the 
internal aspect of what is externally regarded as parallel displacement of 
part of a system. 

8- 8 . The Electromagnetic Potentials . 

There exists an important transformation which leaves invariant the 
momentum vector p^ = i djdx^ + K^ . Let 

0' = e^iAe0 = A0, (8-81) 

where 16 is an algebraic function of the coordinates x^ . Then if 



we havef 
Hence 






Accordingly the transformation 0->0', K^-**/, defined by (8-81) and 
(8-82), leaves p^ unaltered. The electromagnetic field of force is also un- 
altered, because the addition of an arbitrary gradient to the potentials has 
no effect on the force. 

t It is understood that p^ here denotes the value (number or matrix) of the component 
of the momentum vector, not the operational expression -td/&*? M +K M . Thus p^ commutes 
with A. 



8-8] The Differential Wave Equation, 135 

The corresponding transformation of other wave tensors depends on the 
index n of the momentum operator ( i/n) 3/3a? M + K^ . The general law is 

f = AV, (8-83) 

where n is the index of ^ ( 8-6). Since the transformation is algebraic this 
applies to wave tensors of any rank (/S'=A n ), the index of the tensor 
being the sum of the indices of its factors. In particular a wave tensor of 
index is invariant. 

The foregoing transformation will be called a gauge transformation, 
because it is the adaptation to wave mechanics of Weyl's gauge trans- 
formation in relativity theory. We may regard the electromagnetic potential 
Kp as having been created by a non-integrable gauge transformation of 
neutral space-time. If in (8-82) we determine 16 so that SO^dx^ = 2^ , we 
have Kp = 0; that is to say, the gauge transformation removes the electro- 
magnetic field which may therefore be created by the inverse transforma- 
tion. But the equations dO^dx^ = 2^ , determining the transformation, are 
non-integrable unless curl AC = 0. 

To justify the name "gauge transformation" we proceed as follows. If 
16 is imaginary, A is real, and the strain vector $ 2 which generates phase 
space is multiplied by a real algebraic factor A 2 . By 7-7 this represents a 
change of the probability of a range of configurations at the point considered, f 
But the probability is also given by ^*^dF, where ^r, ^* are the ordinary 
wave functions of indices 1 and - 1. J Usually changes of probability are 
expressed by changes of the modifying factor <[>*fa but in this transformation 
^*^r is invariant, since it is of index 0. The transformation therefore changes 
the measure of volume to dF' = A 2 dF. That is what is meant by a gauge 
transformation a change of measure of volume (implying a change of the 
standard of length]_without alteration of the coordinates. In terms of 
coordinates dV=*V gjk, dV'^V^-g' .dr, so that the transformation can 
also be expressed as V g' = A 2 V^- g. 

In Weyl's theory it was taken for granted that changes of electromagnetic 
potential correspond to real changes of gauge. Wave mechanics introduces 
an important amendment. By (8-82) real changes of the electromagnetic 
potential (fc lf * 2 , * 3 , ic ) correspond to real values of 18 , and hence to 
imaginary changes of gauge. The need for this amendment became obvious 
as soon as it was discovered in quantum theory that the significant com- 
bination is - i 3/3^ H- Kp , not 3/Sa^ 4- ^ . 

This amendment removes the only difficulties noticed in the unified 

t Note that this interpretation only holds if A is real. If A is complex (0 W real) the change 
affects the time coordinate and has no effect on the probability. 

J For simplicity we suppose that the transformation redistributes the probability 
without altering the total amount, so that it is not necessary to re-normalise after the 
transformation. 



136 Wave-tensor Cakulus [8-8 

gravitational-electromagnetic theory .| In the attempts to find a geometrical 
invariant representing the Action (loc. cit., pp. 230-3, 257), the difficulty has 
been that in the elementary invariants the total action and the electro- 
magnetic action F^ Fw occur in the combination G + F^ v Ft* v . In particular, 
the generalised volume V(- I *^wl)> which has since been brought into 
prominence by its use in the Born-Infield theory, reduces to this combina- 
tion (loc. cit., p. 233). But there seems to be no sense in adding an electro- 
magnetic action to a total action which already includes it. It is the differ- 
ence G-FpyFP", representing material or non-Maxwellian action, which 
should be represented by the elementary invariants. Devices for changing 
the sign, e.g. by alternating the suffixes in the invariant *G r /4r * (?"'*, were 
proposed, but were not very convincing. But the sign is rectified now that 
we realise that, owing to the identification of the electromagnetic potential 
with real instead of imaginary changes of gauge, IK^ has been substituted 
for Kp and iF^ for F^, throughout the field theory as originally given. 
Consequently F^F^ should have been -F^F^. 

With this amendment the field theory as set forth in Chapter vn of The 
Mathematical Theory of Melativity is acceptable today. The investigations 
in this book have a close connection with it at many points, and confirm it 
by elucidating the manner in which it forms the macroscopic counterpart of 
wave mechanics. 

After introducing gauge systems transformable at will, Weyl pointed out 
that there exists at every point of space-time a natural gauge furnished by 
the radius of spherical curvature; and he later reached the conclusion that 
our actual measures are made in terms of this gauge. This was extended 
by the writer who showed that a natural gauge, not only at every point but 
for measurement in every direction at that point, is provided by the con- 
tracted curvature tensor, and that the law of gravitation is the expression 
of the fact that it is to this gauge that our actual macroscopic measurements 
refer. 

By starting with no determinate gauge system, and thereby discovering 
the natural gauge instead of merely postulating it, Weyl had made a funda- 
mental advance. But, in a sense, his conclusion stultified his premises. The 
principles of physical measurement are bound up with the natural gauge; 
we cannot employ alternative gauges without giving to the words "length ", 
"volume", etc. meanings which they do not bear in physics. Gauge trans- 
formation had become one of those etymological transformations which 
too frequently mar theoretical discussions proclaiming the obvious truth 
that if you alter the meanings of words you may assert anything you like. 
In particular, one of the most attractive features of his electromagnetic 
theory had to be given up, viz. that the arbitrary gradient, which can be 
t Mathematical Theory of Relativity, Chapter vn. 



8-9] The Differential Wave Equation 137 

added to the electromagnetic potential without altering anything observ- 
able, represented the arbitrariness of gauge. Thus Weyl's two results (1) the 
discovery of variable gauge, which accounted for the existence of quantities 
which might be identified with electromagnetic potentials, and (2) his 
discovery of natural gauge, which leads ultimately to the explanation of the 
law of gravitation, seemed to be contradictory; and it was necessary to 
suppose that (2) superseded (1). But we can now accept them both, with the 
modification that the variability referred to in (1) is an "imaginary gauge 
transformation"; that is to say, it is not a change of the real part of log A 
which furnishes the standard for the measurement of lengths and distances, 
but of the imaginary part of log A, which (although called a gauge trans- 
formation by analogy) does not affect the reckoning of length. 

In our present development natural gauge is used from the beginning, 
because displacement first arises as an angular quantity (angle of a trans- 
formation) which is the ratio of the linear displacement to the radius of 
curvature. Thus we do not encounter the preliminary ambiguity which 
leaves the measure of the displacement indeterminate until the radius of 
curvature is brought in as standard.f There is no provision in our theory for 
real change of gauge for using any other standard. The reality conditions 
for rotations restrict the transformation (8-81) to imaginary gauge trans- 
formations, i.e. real changes of the phase angle 16 . 

8- 9 . Non-integrable Transformations . 

In a general way we can trace the origin of the non-integrable gauge trans- 
formation which creates an electromagnetic field. A non-integrable trans- 
formation arises when we contemplate a field of transformation composed 
of transformations which do not commute. For example, the transformation 
^'==e^aid*rM*2jfcjj^ is non-integrable. If we apply it to a square circuit 
composed of successive displacements (dx lt 0), (0, dx 2 ) 9 (dx l9 0), (0, dx 2 ) 9 
we obtain as far as the second order 

0' = (1 - E 2 * 2 dx 2 - frfdvf) (1 - 



which reduces to $' = (1 - 2E l E^c^c^dx^dx^) $. 

Thus the result of taking round the circuit is to transform it to 

^Be-t^s****!**!^. (8-91) 

This, however, does not immediately solve the problem of the creation of an 
electromagnetic field, which depends on a similarly non-integrable algebraic 
transformation. 

t We are able to start in this way because we treat a very simple uniform space-time, 
whereas the field theory is concerned with the origin of the natural gauge system in irregular 
macroscopic space-time. 



138 Wave-tensor Calculus [8-9 

The field K^ is due to systems extraneous to the particle or system S to 
which the wave vector belongs. As explained at the end of 2*9 the 
external particles have their own symbolic frames F^ etc., which commute 
with the ^-symbols of the system 8. The effect of recognising these external 
particles will be to introduce into our calculus a large number of additional 
symbols which commute with the E^ but not in general with one another; 
so that a much wider variety of transformations q can be contemplated. 

We have hitherto ignored the external particles and the transformations 
representing relative displacement of them, because the elementary equa- 
tions suppose S to be in a standard environment, namely neutral space- 
time. But an electromagnetic field presupposes a non-uniform environment. 
A change of position dx^ is not merely a transformation from one point to an 
equivalent point of space-time (4-4); it involves also an intrinsically 
different environment of 8. Thus the displacement dx^ will involve a 
supplementary transformation, representing the change of environment, 
which we may take to be of the form 

0' = exp{F 1 a 1 ^ 1 + 7 2 a 2 da: 2 + r3a3^3 + 7 4 a 4 da: 4 }.0, (8-92) 

where the Y^ are composed of symbols belonging to the extraneous systems, 
and therefore commuting with the E^. We do not suppose that the Y^ are 
anticommuting symbols of a complete set; they will usually be complicated 
symbolic expressions. But provided that they imperfectly commute (as 
normally happens with complicated symbolic expressions), the trans- 
formation (8-92) will be non-integrable; and 0, after being taken round a 
circuit, will not return to its original value but will undergo a F rotation of 
some kind. But since the F-symbol of the rotation commutes with all the 
Ep , it will be indistinguishable from an algebraic transformation; and it will 
count as an algebraic transformation so far as the Enframe is concerned. 

Thus the effect of irregularity of distribution of the surrounding protons 
and electrons, which might be particularised with almost an infinitude of 
detail by introducing their own symbolic frames, is reduced to a non- 
integrable algebraic transformation of the vectors of the -E-frame. This 
transformation represents the difference between the standard environment 
of neutral space-time and the modified environment a difference which 
is recognised as the macroscopic electromagnetic field due to the specialised 
distribution of the external charges. As we have seen in 8-8, a field of non- 
integrable algebraic transformation is equivalent to the insertion of electro- 
magnetic potentials K^ in the momentum vector. 

We have supposed that the supplementary transformation (8-92) contains 
only the symbols belonging to the external systems. Would it not be more 
natural to suppose that it contains combinations of these external symbols 
with the symbols of S, e.g. E^l In that case the non-integrable trans- 



8-9] The Differential Wave Equation 139 

formation will not be algebraic in the .E-frame, and the field cannot be 
represented by a potential vector with algebraic components K^ . I agree 
that it would be more natural. But the question for us is, not what actually 
happens, but what is supposed to happen in the ideal problems to which 
Dirac's equation (8-262) is applied. Actually a charged particle polarises 
the surrounding distribution of electric charges. When it is displaced the 
potential due to surrounding charges is altered by their changed polarisation. 
This effect (Debye-Hiickel effect) is of great practical importance. But it is 
not included in Dirac's equation, which postulates that the field is due to a 
rigid distribution of charge. If Dirac's equation is applied to an electron in 
a field in which the Debye-Hiickel effect is large, it gives an incorrect value 
of the energy. 

Thus in treating the origin of the K^, i.e. of the electromagnetic terms in 
Dirac's equation or equivalently the electromagnetic terms in the momen- 
tum and energy, we must adhere to the same idealised conditions. The 
postulate is that the system 8 itself has no share, direct or indirect, in 
determining K^. We must therefore omit the terms, if any, which are not 
invariant for rotations of the particle, substitution of particles of opposite 
signf or opposite spin, etc. The terms admitted therefore correspond to a 
purely algebraic transformation. There is no need for us to show that the 
omitted terms are small in practical problems; very often they are not. 

The term K^ in the wave equation (8-262) is essentially a microscopic 
electromagnetic potential. In microscopic problems the field due to one or 
more individual particles requires a more complex specification by means of 
multiple matrices. The criterion is that, if the distribution of particles 
producing the field can be treated as rigid, (8-262) suffices. The foregoing 
discussion makes it clear that the field due to the particle itself is not to be 
included in K^. Neglect of this condition has led to the occurrence of an 
infinite self-energy of the particle in certain theories. 

The internal wave equation for the hydrogen atom, adopted in (9-221), 
provider* an exception to the rule that K^ is a macroscopic potential. The 
equation is of the form (8-262) notwithstanding that the electromagnetic 
field is due to a single particle (the proton). This is because the problem is 
transformed by the use of relative coordinates into the motion of a particle 
in a rigid field. It must be emphasised that this is a quite exceptional use 
of *, made possible by the simplicity of the problem, and that the micro- 
scopic interactions of particles cannot usually be represented by a field of 
this form. 

t There is an apparent change of sign of the electromagnetic terms in the wave equation 
when a proton is substituted for an electron; but what has really happened is that the 
electromagnetic terms are unaltered, and all the other terms have changed sign. 



CHAPTER IX 
THE HYDROGEN ATOM 

9*1. Steady States. 

Before tackling a practical problem, it is appropriate to recapitulate and 
systematise certain ideas which have appeared in a scattered way in previous 
chapters. 

In the practical application of wave mechanics the central problem is the 
search for systems which shall be dynamically steady. The phrase ' c dynamic- 
ally steady" requires amplification. 

There is an almost inevitable ambiguity in the use of the words ' ' electron ' ' 
and "proton " in the new physics. We say that (a) an electron is no longer a 
particle but a wave, and (6) that the waves specify the probability dis- 
tribution of an electron. Thus the term is applied both to the distribution and 
to that which is distributed. For definiteness let us call that which occupies 
any point of the distribution an electron-point. We consider then electron- 
points distributed over a domain of geometrical coordinates x^ . A displace- 
ment dXp is a displacement of the electron-point that is contemplated; no 
dynamical conception is attached to the displacement; it is a transfer of our 
attention from one electron-point to another. But we can contemplate also 
a bodily displacement of the whole distribution; such bodily displacement 
is described as a change ds^ of a dynamical coordinate s^ . Here again the 
displacement may be regarded primarily as a transfer of attention from 
one electron-distribution to another, instead of from one electron-point to 
another. But when, by habit, we introduce dynamical conceptions they are 
attached to the displacements ds^, not to dx^. The dynamical electron 
the moving entity is the probability distribution, and its mode of dis- 
placement is wave propagation. 

We need not confine attention exclusively to bodily translation or rotation 
of the distribution. We can consider more general sequences of distributions. 
The general method of specifying a sequence of distributions is by a trans- 
formation q = e iw *\ then a displacement ds signifies the change of distribution 
which is produced by applying the transformation e iwds to the vectors 
describing the distribution. If W is an J5-number, this is a relativity rotation 
of the space vectors defining the distribution, and is therefore a displacement 
without intrinsic change. The corresponding coordinate 8 will be called a 
simple dynamical coordinate. By allowing W to include differential oper- 
ators, we obtain a more general type of displacement including deformation, 
and define a correspondingly generalised dynamical coordinate. Each simple 
dynamical coordinate is closely related to (and frequently confused with) 



9-1] The Hydrogen Atom 141 

a geometrical coordinate x^ or 0^, viz. that defining the direction of the 
bodily displacement or rotation of the distribution; but the generalised 
dynamical coordinates have no geometrical counterparts. In practice, 
however, they often have approximate counterparts; for a generalised 
coordinate usually appears as a slight modification or adaptation of a simple 
coordinate. For example, a free electron possesses (simultaneously) four 
simple dynamical coordinates representing bodily displacement of its 
distribution in four antiperpendicular directions x l9 # 23 , # 45 , a? 16 ; when the 
electron is in the electromagnetic field of a nucleus, we have to find four 
generalised dynamical coordinates to replace these. 

Let us consider a system with four dynamical coordinates s^. In what 
circumstances should we describe the fourfold sequence of distributions as 
steady 1 It would certainly be considered steady if the distributions were 
all intrinsically similar. But that is unnecessarily stringent, since we cannot 
make exhaustive observations of every detail of the distribution. The 
minimum condition is that some recognisable characteristic of the distribu- 
tion shall be steady, i.e. constant over the domain of dynamical coordinates 
8p . Since the observable characteristics (physical vectors) are space vectors, 
we require that a complete space vector J determined by the distribution 
shall be constant over the domain s^ . That is to say, J must be invariant 
for the transformations q = e iw * 8 *. 

We take J to be factorisable. It would be possible to obtain a steady state 
by compounding two pure states neither of which is steady. But the com- 
bination is not of practical importance unless there is security that the two 
states remain superposed with the same relative probability factors when 
external perturbations are admitted. The argument runs: the steady states 
which we wish to discover are those which behave as units under external 
perturbations. Unitary character, i.e. purity, is expressed symbolically by 
a spectral operator. Therefore to bring our symbolism into line with the 
physical conditions to which it is applied, we must represent the unit states 
by spectral operators. The latter are idempotent symbols. We consider in 
particular the idempotent space vector as the simplest element in the 
symbolism that is thrust upon us. We regard the simple elements of the 
symbolism and their physical counterparts, not as hypothetically "exist- 
ing ", but as idealisations which owe their importance to the fact that any- 
thing more complicated can be, and commonly will be, analysed into these 
simple elements. 

By 8-5 a sufficient condition that J shall be constant over the domain 
s, ! , 8 2 , s 3 is that its factors ^r, x* shall be common eigensymbols of W, U^ , 
U 2 , U$. It is easily seen that this condition is also necessary. 

In practice we assume that W, U l9 U 2 , U 3 commute. It is not true that 
operators which have a common eigensymbol necessarily commute ( 3-7 (/)) 



142 Wave-tensor Calculus [9-1 

but if W, #i, U 2 , J7 3 do not commute they will not be constant over the 
domain of s, s l9 s 29 s^ 9 their derivatives being given by (8-52). Transforma- 
tions in which the operational forms are functions of the dynamical co- 
ordinates, e.g. VP(*i,*i,*a), are not considered in wave mechanics at 
present. They bear the same kind of relation to constant transformations 
that general relativity transformations bear to those of special relativity 
theory. The domain of such transformations will have a curvature embodying 
the non-commutability of the rotations in it. Whether it would be profitable 
to pursue the study of non-commuting symbols with a common eigensymbol, 
I cannot say. But it may be worth noticing that existing methods of search 
for steady systems, i.e. distributions with a recognisable characteristic 
which is constant over a multi-dimensional domain, are not necessarily 
exhaustive; and it is just possible that steady states, which may have some 
physical importance, have escaped our analysis. 

Limiting ourselves accordingly to constant transformations, the problem 
of finding dynamically steady states resolves itself into the finding of four 
commuting symbols, or whatever number of symbols may be appropriate to 
the kind of system investigated. The symbols represent constant cha- 
racteristics of the system; but they are generally portions of the constant 
space vector J, and do not imply any constancy of the system additional 
to that originally postulated. For example, when the operators are E- 
symbols, W + U^U^U^E^ + E^E^ + E^^J. 

We must next try to understand why these ' ' steady states ' ' are important . 
Only one of the four coordinates is conceived as time displacement. Our 
dynamical picture of a system pursuing a trajectory in the domain of 
s 9 SJL, s a , 5 3 does not suggest any reason why J should be required to be 
constant in directions transverse to the trajectory. The importance of the 
latter condition is that it introduces the maximum degeneracy into statis- 
tical enumerations. In accepting J as the criterion of steadiness, we 
implicitly decide to count all configurations which have the same J as one 
configuration. Thus the probability occupying the whole of S-space counts 
as the probability of one configuration. Our picture of a configuration 
as an isolated point in /S-space pursuing a trajectory does not apply; it is 
a whole continuum or wave front that "travels". Moreover, when (as in 
quantised systems) the four-dimensional /S-space exists only for discrete 
values of J 9 and intermediate values of J occupy loci of three or fewer 
dimensions, the discrete values have infinitely greater probability than the 
intermediate values. 

An /S-space formed by generalised dynamical coordinates is not on quite 
the same footing as an /S-space formed by simple dynamical coordinates. 
The difference is that the generalised displacement is a transformation 
peculiar to the system, and is not applicable to its idealised environment. 



9-2] The Hydrogen Atom 143 

Now a system without an environment is unthinkable; and it is no use 
displacing it all over the S -space if it cannot take with it the environment 
which its structure demands. That was the early mistake of relativity 
theory, which applied transformations to the differential equations but 
omitted to provide for their application to the boundary conditions. The 
standard environment is uniform, i.e. spherical, neutral space-time. This is 
conceived as permanent; so that the only transformations admitted are 
those which transform it into itself, namely the kinematical ^/-rotations. 
Hence, in general, if we apply generalised W transformations to a system, it 
will no longer fit the boundary conditions where it merges into standard 
space-time. Physically we should say that the new configuration requires 
a pressure or an electromagnetic field to maintain it. 

It might seem from this that the importance of generalised steady states 
is fictitious. But we have to remember that the standard environment is a 
simplification of the actual environment. The practical physicist is not 
concerned with a hydrogen atom existing alone in uniform spherical space. 
He deals with hydrogen atoms surrounded by other atoms or ions, or in 
fields of radiation. We admit that it would be useless to consider generalised 
displacements of a system which could not be applied to its environment; 
but this is discounted by the fact that we cannot say beforehand what the 
exact environment will be, and what type of displacement it will admit. 
We therefore investigate the most general steady states, since they may 
become realisable if the environment is appropriate. 

To conform to our observational knowledge there should be just one 
steady state of a hydrogen atom in the standard environment, namely the 
ground state. The other states can be realised if the environment is such as 
is capable of "exciting" the atom in particular if it contains a field of 
radiation. 

We proceed to an investigation of the steady states of a hydrogen atom. 
This belongs to quantum theory proper, and strictly speaking is outside our 
territory. But it is necessary for liaison purposes to follow through in our own 
notation, and from our own point of view, one practical problem which 
introduces the leading ideas of quantum theory. Unexpectedly the in- 
vestigation has proved to be vitally important, because it reveals an in- 
consistency of a factor 2 (for which, I think, the quantum physicists must be 
held responsible) which led to an error in the numerical results found in 
earlier versions of the present theory. 

9-2. The Commuting Operators for a Hydrogen Atom. 

The conditions for a steady internal state of a hydrogen atom were obtained 
approximately by Schrodinger as a differential equation of the second order. 
This is now superseded by Dirac's exact treatment. It is commonly said that 



144 Wave-tensor Calculus [9-2 

Dirac replaced Schrodinger's second order equation by a first order equation 
(W m)<Jj = 0. I regard this as a misconception. The modern equivalent of 
Schrodinger's equation is the set of partial differential equations 3 W/ds^ = 0, 
30i/8* M = O f etc., which secure that certain characters remain steady for 
fourfold displacement. Since W, U l9 U 2 , U$ contain differential operators, 
these equations are of the second order. The introduction of ^-matrices has 
not affected the fundamental conclusion that the condition that ifi shall 
represent a steady state is expressed by differential equations of the second 
order. 

By the dynamical equations (8-52) the equation 3W r /3 1 = is equivalent 
to 4 W WUi = 0. In this way the original set of equations is reduced to 

the condition 

W, U 19 U 2 , C7 3 mutually commute. (9-21) 

We may regard (9-21) as a first integral of the second order equations. 

The way in which Dirac's wave equation enters into the problem is that it 
fixes the analytical form of one of the symbols W in (9-21); and hence it is 
the starting point for determining three other analytical forms which com- 
mute with it and with one another. 

We have shown in 9-1 that $ must be a common eigensymbol of W, U l9 
U 2 , J7 3 ; so that, if ra is the eigenvalue of W, we have (TF-ra)^ = 0. This 
equation is provided by (8-261), which we found as a condition for the con- 
servation of probability . We could not at that time show that m was constant 
in time; but this now follows from the dynamical equations, because W is 
constant. We have still to show that m is the same for different states of the 
system. It will be proved in Chapter xn that (for particles represented by 
simple wave tensors) m has one of two absolutely determined values, corre- 
sponding respectively to electrons and protons. Using this result in anti- 
cipation, we shall take m to be a constant of nature. 

The eigenvalue m of W is imposed from outside; there are no limitations 
on the eigenvalues of U l9 U 2 , Z7 3 other than those which will be found in the 
course of the investigation. This difference is due to the fact that s is singled 
out as the connecting link with extraneous systems ( 8-5). 

We now consider the wave equation (8-262) for an electron in an electro- 
static field of positive potential proportional to 1/r, so that 



^-(0,0,0, -t/r), f^- 
Then writing (8-262) in the form (JF-w)^ = 0, we have 



We seek three other operators, linear in 9/3^, which commute with W and 



9-3] The Hydrogen Atom 145 

with one another. It is easily verified that the following satisfy this condition : 



~ 1 ' (9>222) 
< 9>223) 

^=a=-- (9>224) 

We denote the eigenvalues of U , Z7 2 , U z , JF by /A, u, 9 m, and our problem is 
to find the relations which must exist between them, in order that the com- 
mon eigensymbol ^ may satisfy appropriate boundary conditions. More par- 
ticularly we wish to determine c, which is the energy conjugate to , in terms 
of w, /A, u. The method of solution here followed is mainly due to Temple.f 
We have enunciated the problem in an abstract way, without mentioning 
the hydrogen atom. But (9*221) is the hamiltonian of the hydrogen atom 
adopted in Dirac's theory, and we shall provisionally assume that that is 
the physical application. There is no obvious reason to expect that the wave 
equation (8-262), in which the K^ are potentials of a macroscopic electric 
field ( 8*9), will be adequate in the interior of an atom; so that for the time 
being the term E^OL/T in (9-221) is only justified empirically. But in Chapter 
xv we shall find that (9-221) is the exact hamiltonian for a system of two 
elementary particles and we shall determine the theoretical value of the 
constant a. 

9*3. Solution of the Equations. 

Let E r ^(E l x l + E^x^E, A x^lr (9-311) 

so that E r 2 = 1, and E r anticommutes with E. By direct multiplication 






r 3 a^ ~ Xl te 

tj^&i-l. (9-312) 



Hence by (9-221) 

(9-313) 



Multiplying this by the common eigensymbol ^, the operators W, U i9 
reduce to their eigenvalues m> p, *, so that 



" ai (9 ' 321) 

t Proc. Roy. Soc. A, 127, 349 (1930). 



146 Wave-tensor Calculus [9-3 

Or, writing ^ = ^01, 

= Q. (9-322) 



T ^4- "El Jf Jjl _ _|_ fl * If /Wl /2 A ~Ifl mm U' B' Jk* /AOQ1\ 

XJ\Jv jF "- "" p ^A ^ "i ** r t"j ^y sag p py < jLv * * fwm "f^A QC l7 OJA) 

so that the equation is da> / v Q\ 

-j- 1 jj I" i 1^ ss y, i y*oo^& ) 

c/y \ r / 

By (9-331) -F 2 =/ a , (? a =8' a , FG+GF= -2*e, (9-333) 

where /=(m a - a )*, gr^-a 8 )*. (9-334) 

To solve (9-332) we make the algebraic substitution 

r=x/2f, a>=e-* x $, (9-341) 
and take and x as our new variables. The equation becomes 



In an ordinary algebraic equation this transformation would have re- 
moved the last term; but in the symbolic equation it leaves a singular 
coefficient J (F/f 1) instead of a zero coefficient. Assume a solution in series 



(9-351) 
o 

where the C n may be non-algebraic. In order that the integral of the three- 
dimensional density ^(f>*r 2 dr may be finite for a region enclosing the 
origin, 0r and </>*r must be finite or else tend to infinity less rapidly than r~*. 
Hence co is finite or diverges less rapidly than r~* f so that 

p>-\. (9-352) 

Further, in order that the same integral may converge when the region 
extends to infinity, the series must terminate at some finite value of n, say 
n = n'. (It can be proved that, unless the series terminates, it diverges 
when #->oo.) 

Substituting (9-351) in (9-342) and equating coefficients of x n +-\ we 
obtain the recurrence relation 

(*+p + Q}0 % --l(PU-l)Ort. (9-361) 

Setting n = 0, (p + G) <7 = 0. So that 6 f is an eigensymbol of G, and p is an 
eigenvalue of G. Hence, by (9-333), p = g. We shall find later that there is 
no possible value of g between and , so that by (9-352) p =g. 

Setting w=w' + l, we obtain %(F/f l)C n , = 0; so that C n . is an eigen- 
symbol of F, the eigenvalue being/. 

Multiply (9-361) by (F/f+ 1); since jP^/ 2 , we have 

(F/f+l)(n+g + G)C n = 0. (9-362) 

Hence, by (9-333), 

0. (9-363) 



9-4] The Hydrogen Atom 147 

Now set n = n'. Since FC n '=fC n *, we obtain 

rc' + gr-<X//=0. (9-371) 

Or, by (9-334), c 



a 



This is Sommerfeld's formula for the energy . Here n r is a positive integer 
or zero. It remains to determine the possible values of p. 

9-4. The Eigenvalues of U^ U 2 . 

Take spherical polar coordinates r, 0, <f>, so that </> is the aziniuthal angle in 
the plane # 2 a? 3 . Then U 2 = i 3/3^4- \iE^\ and, since u is its eigenvalue, 

(Ut-u)$ = (-idfi<l> + \iEw-u)$ = Q. (9-411) 

Let x = e^' tt +*^3>^. (9-412) 

Then fJ 

= by (9-411). 

Accordingly x & constant for change of ^; and since (assumed to be 
single-valued) is unaltered when (f> is increased by 2?r, we have 



Or, since e* ^ = 1 = e* w , 

^r = e-^ + *> 2w ^. (9-413) 

Therefore w + is a positive or negative integer. The eigenvalues of Z7 2 are> 
therefore the half odd integers positive and negative. 
To find the possible values of JLI, we write 

a>i = xJltot-xJlfai+ JjB MJ etc. (9-421) 

Then it is easily verified that 

<>i<>2 ctf a a> 1 = co 8 , etc. (9-422) 

Hence 

<o 1 (co 2 + to 3 ) = (a> 2 + ia>3)(a> 1 --i), (9-431) 

a>! (a> 2 - ia> 3 ) = (a> 2 - ia> 3 ) (^ + i), (9-432) 

(a> 2 + ia> 3 ) (a> 2 i<o 3 ) = w 2 2 + a> 3 2 icoi , (9-433) 

(a> 2 - ia> 3 ) (a> 2 + ia> 3 ) == a> 2 2 + co 3 2 + ia> L . (9-434) 

By (9-222) and (9-223) 

tJ? 4 {7 1 ^ i8 ai 1 + lf8 1 cii t + JB u cii, + i 9 02 = ^!. (9-441) 

Since J? 4 commutes with U l9 we obtain by squaring the first expression, 



using (9-422). Hence 

(9-442) 



10-2 



148 Wave-tensor Calculus [9-4 

and, by (9-434), 



(9-443) 

Our solution ^ is an eigensymbol of U 19 U 2 , but not of E . We can, however, 
obtain a common eigensymbol xo of the three commuting symbols U l9 U 29 E^ 9 
with eigenvalues /*, M, - i, by the method of 3-7 (e) 9 namely #0 = (E^ - i) ^, 
since this gives (E^ + i) Xo = - Hence by (9-442) and (9-443) 

(9-451) 



We introduce a series of symbols defined by the recurrence relation 

Xr = K + *w 8 ) fr-i ' X-r * ( Wa ~~ ia) *) x - f +* ' 
r being positive. Multiplying (9-452) initially by (w 2 + iw 3 ), it becomes 

(co 2 4.fco 3 )(co a -ia> 3 ) Xl = {~(,,--l) 2 + (i + ^} Xl . (9-471) 
Also 

ico! xi = io*! (o> 2 + io> 3 ) xo = i (o> 2 + iw 3 ) (c^! - i) Xo 

by (9-431). Since U 2 =ia) l9 this gives 

^2Xi = K + ^3)(^+l)Xo = (^+l)Xi. (9-472) 

Adding (9-471) and (9-472) and using (9-433) 



2 } Xx- (9-473) 

By (9-472) the substitution of xi for XQ changes the eigenvalue of U 2 from 
u to u l = u+ 1; and, comparing (9-473) with (9-451), we see that the eigen- 
value of (w 2 2 + ci> 3 2 ) retains the same form with u changed to % . 
Proceeding step by step, we find that for x r > the eigenvalues of U 2 and 



Since U 2 =ia) l9 the eigenvalues of w^ are negative. Similarly the eigen- 
values of w 2 2 and o> 3 2 are all negative. If this implies that the eigenvalues of 
co 2 2 + o> 3 2 are all negative,! there is (for given /x) an upper limit to the value 
of u r 2 in (9-474). Hence the series of symbols XT must terminate with symbols 
Xfc> X-fc' sothat 

Xk+i = (^2 + ^3) Xk = 0, X-*'-i = (o> 2 - ico 3 ) x-tf = 0. (9-481) 
Adapting (9-452) to Xk> ^ e condition gives 



We find similarly 0=-(fi-l) 2 +(-i + u^) 2 9 so that 

( / i-l) 2 = (^ + fc + J) 2 = (^~&'-i) 2 . (9-482) 

Hence, for a fixed value of /*, the possible values of u range between 
| ^_ 1 1 - an d _ | JX- 1 1 + , the former limit corresponding to & = and 
the latter to *' = 0. 

t The legitimacy of this inference is examined later. 



9-5] The Hydrogen Atom 149 

For our purpose the pertinent result is that by (9-482), /A is a positive or 
negative integer; for we have already seen that u + $ is a positive or negative 
integer. We are therefore able to calculate the energies e of all possible 
steady states by giving n* and ft integral values in (9-732). Zero value of ^ is 
excluded because it would make the term VO* 2 - <* 2 ) imaginary; zero value 
of n' is not excluded. 

The assumption made above that o) 2 2 4- o> 3 2 has only negative eigenvalues 
requires consideration. I do not think there is a general law for non-com- 
muting symbols that if the eigenvalues of X and Y are all negative the 
eigenvalues of X + Y are all negative. But here we contemplate a restricted 
type of eigensymbol i/t. In one form or another conditions must be introduced 
which make the expectation value with respect to iff intermediate between 
the greatest and least eigenvalues. As this part of the theory of eigen- 
functions in wave mechanics does not concern us very closely, we shall not 
enter upon it. Accepting such conditions, the expectation values of c^ 2 and 
o> 2 2 will always be negative, and therefore the expectation value of w^ -f o> 2 2 
will always be negative. The eigenvalues, being particular cases of expecta- 
tion values, will therefore be negative. 

9- 5 . Metastable States . 

According to the theory which will be developed in Chapter xv, our adopted 
wave equation for the hydrogen atom is exact] that is to say, the interaction 
of the proton and electron is precisely expressed by a potential * 4 = icK.fr. 
There is no failure of the formula however small r may be. We have found 
exact solutions of the exact equations; and they agree with the well-known 
series of quantum states of the hydrogen atom. But we seem to have proved 
too much ! Observationally these states are only imperfectly steady; even 
if the atom is undisturbed, they do not endure indefinitely. Thus a state 
which satisfies the exact theoretical conditions for a steady state is found 
observationally to be imperfectly steady. 

The explanation is that in practical applications we have to take into 
consideration the environment as well as the atom itself (p. 143). We have 
found the exact solutions of the differential equations; but whether a par- 
ticular solution has an exact counterpart in nature depends on whether the 
boundary conditions which it demands are forthcoming. The boundary 
conditions of a "state " are difficult to visualise. Our general outlook is that 
whatever exists outside | the state is to be treated as a possible source of 
perturbations; an environment is therefore regarded as conformable to the 
boundary conditions of the state if it causes no perturbation of the state. 

Light is thrown on this subject by distinguishing between the algebraic 
and the non-algebraic wave functions of a hydrogen atom. The algebraic 
t In the sense of not belonging to the state, not necessarily exterior to it in space. 



150 Wave-tensor Calculus [9-5 

solutions of (9*332) are easily found. If CD is an algebraic wave function, the 
term d/dr in 



reduces to an algebraic function of r, so that o> must be an eigensymbol of 
F + O/r for all values of r . Therefore co is an eigensymbol of F and G. Then 
by 3-7 (/), FQ- OF is singular (or zero). By (9-331) 

$(FG-GF) = iE r n-E r Etmp + iEim<x,, (9-51) 

so that J ( FG - GF) 2 = c 2 /* 2 - m V + w 2 a 2 , (9-52) 

since E r , JS? 4 , J5? rJ E 4 anticommute. The right-hand side must vanish, because 
FG GF can have no reciprocal. We have therefore 

2 /(w 2 - 2 ) = O* 2 - oc 2 )/a 2 . (9-53) 

Comparing with (9-372), we see that the algebraic wave functions correspond 
to n' = 0. In this case the series (9-361) reduces to its first term, and the wave 
function is 



(9 . 54) 

It is well known that the states given by n' = are the metastable states. 
Accordingly the distinction between the metastable and the unstable states 
is that the former have algebraic wave functions and the latter non-algebraic 
wave functions. 

It appears therefore that, in order to satisfy the boundary conditions 
furnished by the standard environment uniform neutral space-time the 
wave function must be algebraic. Non-algebraic wave functions do not 
precisely satisfy the boundary conditions; but the discrepancy can be 
treated as a perturbation. The spontaneous transitions from these states 
to lower states are attributable to the perturbations which represent this 
discrepancy. Even the algebraic wave functions are not quite perfectly 
conformable to the boundary conditions, the only really permanent state 
being the ground state. 

9*6. Single -valued ness of the Internal Wave Function. 

We must now refer to the assumption italicised in the second paragraph of 
9-4. In determining the possible eigenvalues of U 2 and U l9 $is assumed to 
be a single-valued function of rectangular coordinates. But the assumption 
is untrue. Dirac's Lorentz invariant wave vector $ is necessarily a double- 
valued function of rectangular coordinates ( 4-7). Little attention seems to 
have been paid to this inconsistency; but it reveals a flaw in the foundations 
of the current theory of the hydrogen atom. 
We have to consider two alternatives: 

(1) The investigation in 9-4 must be amended so as to apply to double- 
valued instead of single-valued wave functions. 

(2) The eigensymbol is a single-valued function, and the investigation 
in 9-4 is correct ; but in this case cannot be a wave vector, and ( W - m) $ = 



9-6] The Hydrogen Atom 151 

cannot be Dirac's original Lorentz-invariant equation. Its origin therefore 
remains to be investigated. 

Our theory will be found to lead to the second alternative; but we may 
briefly consider the first. The result of taking $ to be double-valued is that, 
when the angle <f> is increased by 27r, may become either + or - $. We 
can then deduce that the eigenvalues of u are the integers and half -integers. 
No further change is made until we reach (9-482), which shows that the 
eigenvalues of p will also be the integers and half integers. We obtain 
therefore twice as many eigenvalues of p and of u as in the previous dis- 
cussion. The additional eigenstates do not correspond to observed states of 
the atom. 

Double-valuedness of the eigenfunction has been discussed by Temple.f 
It is pointed out that there would be a distinction between the integral and 
the half-integral eigenvalues, the latter being in a sense ineffective because 
a displacement has no matrix components corresponding to them. It is 
doubtful whether this ought to be regarded as an excuse for, or an objection 
to, a theory which employs the double-valued $. But the crux of the matter 
is the exclusion principle. The observational result is that when we consider 
a nucleus with a number of electrons, the electrons occupy the states given 
by integral values of /x and u + , and ignore the "ineffective" half-integral 
values. It would therefore be necessary to abandon the accepted form of the 
exclusion principle, and substitute a new principle according to which only 
a quarter of the cells, into which phase space is divided by the eigenfunctions, 
are allowed to be occupied. This would be a very drastic alteration of existing 
quantum theory. 

Following the second alternative, the only change required is to recognise 
that the eigenfunction of an internal state of the hydrogen atom (dis- 
tinguished as ^ H ) returns to its original value when the space axes are rotated 
through 27r, and therefore rotates twice as fast as the wave vector ^ (dis- 
tinguished as ifi D ) in Dirac's Lorentz-invariant wave equation. Thus fl 
transforms like a space vector or strain vector. We have seen that the in- 
ternal configurations of a system are specified by strain vectors, and we 
therefore take ifi H to be a strain vector. It must be of index 2, since a strain 
vector of index (representing a combination of waves travelling in opposite 
directions) would not have the unidirectional properties of H . 

Current theory has assumed that the momentum in practical units is 



t An Introduction to Quantum Theory, pp. 106, 131. His discussion (which is based on a 
treatment by Born and Jordan) refers to the Schrodinger scalar wave function. It is pointed 
out that (even in that case) ^ may not be a single-valued wave function, and that the usual 
inference as to the integral values of the angular quantum numbers is "decidedly precarious". 



152 Wave-tensor Calculus [9-6 

It is on this basis that the observational value of h is determined, and we 
must accept it as a definition of h. The momentum operator for a tensor of 
index 2 being iA/27r, the standard momentum operator for a wave vector 
of index 1 will be ih/Tr by (8-65). We have therefore 



The common assumption, that the momentum operator which applies to 
the wave function of a hydrogen atom applies also to the relativistic wave 
functions introduced in Dirac's Lorentz-invariant equation, is thus found 
to be untenable. The error of a factor 2 has escaped notice, because the 
Dirac wave functions are highly abstract (being referred to an unobservable 
geometrical frame) and are not directly concerned in comparisons of theory 
and observation; it is in connection with internal wave functions, which 
(as we have seen) are not Lorentz invariant ( 7- 1), that quantising conditions 
arise. But it is important that the factor should be set right in fundamental 
investigations of the connection between relativity theory and quantum 
theory. We have therefore to note that the theoretical unit of mass used in 
our fundamental investigations is such that A = TT, not 27r, and that the 
momentum operator for wave-vector functions is (in practical units) 

* * (9 ' 63) 

It is in accordance with the theory of phase space in Chapter vii that the 
configurations or states of the hydrogen atom should be discriminated by 
strain vectors of index 2 rather than by wave vectors. The theory of the 
energy levels of hydrogen thus falls into line with the general theory of the 
representation of internal configurations of systems. Looking at the matter 
from a physical point of view, the double-valuedness of \fi D reflects the abstract 
character of the analysis which introduces it. We can only observe relations 
between two systems or parts of a system, so that the minimum we can 
contemplate observationally is a double function *f> D .$ D '. Rotation of the 
axes through 2ir no longer introduces an ambiguity of sign; each factor 
becomes multiplied by e, and the product by e 27ri . When we consider 
relative motion, i.e. refer each particle to the other as origin, ^, ^/ 
become equal; ty D gives the distribution of the electron relative to the proton, 
and 02/ the distribution of the proton relative to the electron. In this case 
^H ^D-^D-t If we prefer to represent the state by a wave vector if/ D9 we 
must remember that ^= V^r H , and not ignore the ambiguity of sign 

| In this way wave tensors of index 2 which are perfect squares acquire a special 
prominence in practical problems. The wave tensors of index which correspond to them 
are Hermitic ( 8-6). This seems to be the origin of the Hermitic conditions, which have 
been applied too indiscriminately in current quantum theory. 



9-6] The Hydrogen Atom 153 

which is an essential characteristic of \fi' D . Taking the square root is, however, 
a gratuitous complication; and the actual investigation of the hydrogen 
atom is based on H . 

As a working rule, we note that normally the coefficient of the momentum 
operator will be ih/ir for absolute coordinates of a particle referred to a 
geometrical origin, and i A/2?r for its relative coordinates referred to another 
particle as origin. But it is impossible to lay down a universal rule, since the 
momentum operator depends on the index of the wave function adopted 
(8-65). 

Accordingly in practical units the special form for the hydrogen atom of 
the wave equation (8-262) of the double-valued wave function will be 

E *\- l *E ^-fiiUj-0, (9-64) 
dxj cr * J ^ D ^ ' 

since in practical units ^ = (0, 0, 0, fe 2 /cr). When, as in (9-221), the 
operand ^^ H = ^f D ^ D9 the coefficient A/TT must be replaced by h/2rr to give 
the same condition. It follows that the constant a in (9-221) is 

l/a = Ac/27re 2 . (9-65) 

Attention may be called to a further point on which we are unable to 
accept the current view. In (9-224), following the usual notation, we have 
denoted the coordinate conjugate to U B by t. We must now point out that t 
has nothing to do with time. Our equations refer to the internal states of the 
atom, and we have seen (7-1) that the only time- variable in an internal 
state is the dynamical or proper time s which is conjugate to W. If the atom 
is moving in an external frame the coordinate time in that frame is obviously 
not the t referred to in the equations. The dynamical coordinate conjugate to 
U 3 is of an altogether different nature, and will be identified in Chapter xv 
with the linearised permutation coordinate of the proton and electron. 



CHAPTER X 
DOUBLE WAVE VECTORS 

10* 1 . Multiplication of Probabilities. 

Probabilities are combined by multiplication. We have seen ( 7-7) that the 
configuration of a system and the probability of that configuration are speci- 
fied by a single symbol a strain vector. The multiplication of probabilities 
therefore involves a multiplication of the strain vectors; and the treatment 
of a combined system is based on the multiplication of the vectors descrip- 
tive of its separate parts. 

If the combined probability of two events is precisely the product of their 
separate probabilities, the two events are said to be independent. Similarly, 
we define independent systems to be such that the combined vector speci- 
fying probability and configuration is the product of the separate vectors. 
If in combining the systems the product is modified, the modification con- 
stitutes an interaction. For the present we consider combination without 
interaction. 

It is, of course, entirely opposed to our habit of thought to regard a system 
as the product of its parts rather than as the sum of its parts. Therefore we 
have a long way to travel before we can connect the combination of systems 
by multiplication with our ordinary outlook. 

Admitting that multiplication is the primary operation in the theory, the 
prevalence of exponentials in the formulae which we have developed in- 
dicates the way in which the subsidiary operation of addition arises. We see 
that in general the additive quantities of physics must occur in exponentials. 
An elementary example is afforded by action, which is well known to be 
additive; in wave mechanics it is represented as the phase angle of an 
imaginary exponential. Other cases are not so simple. We speak of the 
density p in a unit volume, due to the probability distribution of a particle, 
without specifying whether it is a low probability of a high mass or a high 
probability of a low mass; this agrees with the principle that the probability 
p and the characteristic m of the system are not to be detached from one 
another. If we consider a second particle, the combination of probabilities 
pp' involves taking the product pmp'm' =pp'. But the combined system is 
ordinarily considered to be characterised by a density p+p'* This is a point 
to be investigated in due course.! 

In so far as an addition a + j3 arises from a multiplication e" . eP> addition 
must be looked upon as a possibly non-commutative operation. We have to 

f The methods by which wave mechanics is adapted to treat additive properties instead 
of multiplicative properties are explained in Chapter xvr. 



1O2] Double Wave Vectors 155 

treat " ordered sums " of matrices as we treat ordered products. An integral 
may also require to be ordered. I do not propose to use the algebra of non- 
commutative addition, which is even less familiar than the non-commutative 
multiplication which we have employed; I mention it in order to show the 
type of mathematical complication in which we should be involved if we 
attempted to follow the common outlook more closely. 

Our first task is to study the double wave tensors which are formed when 
we multiply the simple wave tensors specifying the probabilities and con- 
figurations of two independent systems. 

10*2. Double Frames . 

By multiplying two wave vectors a , <f>p belonging to two independent 
systems we obtain a 16-valued quantity *F aj8 which we call a double wave 
vector. Formally it resembles a wave tensor of the second rank (mixed or 
covariant) obtained by multiplying wave vectors ^, Xp belonging to the 
same system; but it has a wider field of transformation. In a wave tensor a 
transformation of x * s locked to the transformation of ^r; but in a double 
wave vector there is no such restriction. If T is a covariant wave tensor and 
*F a double wave vector, their most general transformations are 



the former involving 16 coefficients g aj3 , and the latter 256 coefficients 



For example, let the objects described by the wave vectors 0, ^ be the 
earth and moon. We may use different axes of reference for the momenta, 
spins, etc., of the earth and moon. If we change the axes, there is no need to 
apply the same transformation to those used for the earth and those used 
for the moon. It is true that we do not usually avail ourselves of this liberty, 
and a transformation of axes would ordinarily apply to both bodies; but 
that is because we contemplate an earth-moon system, which is not 
merely a mental association of the earth and moon, but comprises physical 
relations between the two bodies not to be found in either body separately. 

More usually we are interested in the other aspect of a relativity trans- 
formation, in which it is regarded as a displacement without intrinsic change 
of a physical system, the frame being kept fixed. In the case of the earth and 
moon, this implies the introduction of two sets of vectors ^ a , fy capable of 
being rotated independently. 

To provide for independent transformations of ^ and ^ we must introduce 
two symbolic frames. We have therefore two complete orthogonal sets 
Ep, Fp. Inthematrixrepresentationof the symbols we take (forconvenience) 
Ep and F^ to correspond to the same matrix; but in chain multiplication 
EH is connected with the first suffix of a preceding or following double vector, 



156 Wave-tensor Calculus [10-2 

and Fp with the second. If more than one EorF symbol appears in a product, 
the jE's are in chain with the JE 9 s and the F'a with the F's. The order of 
writing E'B and F'B is of no importance; that is to say, every E^ commutes 
with every F^ . As usual E^ = i, -F 16 = i. 

Unless the contrary is stated we shall suppose that E^ and F^ are right- 
handed sets. It may sometimes be more appropriate to use a right-handed 
and a left-handed set; in this case the matrices E p and F^ cannot be identical, 
JF 16 and one of the pentads being reversed in sign as compared with the corre- 
sponding JS?-inatrices. The corresponding modification of the various 
formulae is easily found. 

Since (for right-handed sets) the letters E, F merely indicate the chain 
connection, there is no need to distinguish them when row-and-column 
suffixes are inserted. The E F notation is a device for extending the matrix 
notation, which omits suffixes, to expressions of the fourth rank such as 



The 256 double symbols EpF v (^, v= 1, 2, ... 16) constitute a complete set; 
that is to say, if a linear function of these symbols with algebraic coefficients 
is called an 2?.F-number, the operations of addition, subtraction and multi- 
plication applied to JfjP-numbers always yield ^J^-numbers. In matrix 
representation every E J'-number is a fourfold matrix of the fourth rank, 
and conversely. A matrix of the fourth rank T^ps is resolved into com- 
ponents according to the formula 



(10-221) 

or, without suffixes, T = S t E F v . (10-222) 

/*" 

The definition of pure wave tensors is extended to double wave tensors. 
A pure double tensor J is the product of two double wave vectors T, X*, 
and we have j. TO -E^ Vr . (10-223) 

A convention as to the order of suffixes in T or J is necessary. It is defined 
by (10-221). We write the product of the double vectors x F ajff , X y g* as / ay ,0s . 
The comma separates "first" suffixes (in chain with E^) from "second" 
suffixes in chain with F^ . 

To determine j^ v9 multiply both sides of (10-223) by E a F r and take the 
spur. The spur is formed by identifying the last suffix of each chain with the 
first suffix; thus the two chains contract separately. In particular we have 
spur (EpFJ^spm J^xspur F v . Hence, by (3-32), 



spur E u F v = if a or v ^ 16) 

M . \ (10-231) 

= ~16if/Lt = v=16. J v ' 

Hence spur p^J^F^FJ = spur (j^E^F^) - !#. (10-232) 



10-2] Double Wave Vectors 157 

By (10-223) this can be written 



Thus the formula corresponding to (3*37) is 

j^&X'E.F^. (10-24) 

Let the symbols E^F V in some conventional order be denoted by K a 
(or = 1, 2, ... 266). Then K a 2 = E^F* = 1, and K , K r either commute or anti- 
commute according to a fixed scheme. There are also multiplicative relations 
of the form K a K r = K^ or iK^ . These properties constitute the structure of the 
symbolic frame K^ , and any other set of symbols with the same structure is 
an equivalent frame. Equivalent frames are obtained by the transformation 



where q is any non-singular X-number. For, as shown in 2-7, this trans- 
formation does not change multiplicative relations. 

The theory of double frames K^, double wave tensors jF ay 0$, double 
strain vectors 5 ax ,0g, etc., follows on similar lines to those of simple frames, 
tensors and vectors. Since q now contains 256 components, we have 256 
independent relativity rotations, i.e. transformations without intrinsic 
change. The double space vectors are outer products of two complete space 
vectors, or sums of such products. We may call them complete space tensors 
of the second rank; but besides containing the ordinary components of a 
symmetrical tensor of the second rank, they include components of tensors 
of the third or fourth rank with some degree of antisymmetry (just as a 
complete space vector includes an antisymmetrical tensor of the second 
rank). 

We notice that the new matrices K^ given by (10-25) are not in general 
simple products of JF-matrices and .F-matrices. For example, if q = **!* 9 9 



This is evidently necessary because the K transformation has 256 coeffi- 
cients, whereas separate transformations of E^ and F^ are defined by 32 
coefficients. 

Considering a system described by a double wave tensor T, the trans- 
formation T f = qTq~ l , where q is a A-number, will represent displacement 
without intrinsic change. For, as in 2-9, the new system could have been 
obtained by carrying out the same construction in a different but equiva- 
lent frame K'qKq-*. This extension of the relativity principle is rather 



158 Wave-tensor Calculus [10-2 

difficult to grasp, because at first sight it seems to conflict with our ex- 
perience. If we rotate the moon without rotating the earth, we create an 
observable difference in the angle between their axes, which must surely be 
counted as an intrinsic change of the double system. The earth and moon is 
scarcely a fair illustration; but it is tempting to think that there would be 
an analogous intrinsic change in a system of two elementary particles 
(protons or electrons), since these possess planes of spin. Now it is clear 
that, if there is anything in the double system which is intrinsically changed 
by the transformation, it is unrepresented in the symbolic tensor T.f It 
signifies therefore that the conceptual process of combining systems is 
something other than a pure multiplication; in other words it introduces an 
interaction. We have already had a hint as to the nature of this modification, 
namely that in a combined system we contemplate only "simultaneous" 
configurations of the particles. In this chapter we shall not consider inter- 
action. Our point of view may be expressed by saying that we shall treat 
double systems as a preliminary to introducing the cementing interaction 
which will make them into combined systems. 

It has to be remembered that particles described by simple stream vectors 
have an even probability distribution throughout spherical space. Relations 
between them are therefore not very comparable with the relations between 
localised objects such as the earth and moon. It is one of the features of 
interaction that it renders possible a more localised probability distribution, 
as is illustrated in the theory of the states of a hydrogen atom. 

10*3. The Interchange Operator. 

Since the complete sets E^, F^ commute, they come under Case (6) of 2-7. 
They are therefore connected by a transformation 

, P 2 =l, (10-31) 



16 

and P = JSjE7 M ^. (10-32) 

i 



Since P 2 =l, PE^P^PE^P.PF^^F^. (10-33) 

And quite generally the operation P(...)P interchanges E and F within 
the bracket. 

Let y^-t(Vr + *r'|.). ^ = 4(V,-^). (10-34) 

Since P (E^F V E 9 FJ P = E^ E^F V 

on multiplying by final P, we have 



so that PY^=Y^P> PC |W --{ W P. (10-35) 



t It may be expressed in the matrix representation of T 9 since matrices have the property 
of symmetry or antisymmetry which is not invariant for transformations. We have seen 
( 7*3) that this discrimination corresponds to laying down planes of simultaneity. 



10-3] Double Wave Vectors 159 

We can write, instead of (10-222), 

2 7 =r-f2 7a = S^/y /iV + S/ MV ^ v , (10-361) 

where V^ + 'rfi* V"^ ""*** (10-362) 

Then PT=T*P, PT a =-TP. (10-363) 

That is to say T is divided into symmetrical and antisymmetrical parts 
T 8 , T a , which respectively commute and anticommute with the interchange 
operator P. 

There are 136 independent matrices y^ in the symmetrical part, and 120 
independent matrices ^ in the antisymmetrical part. 

Setting as usual T a j3=T^ a , we can show that 

PT=-f, Y*P=~T*. (10-37) 

By (10-31), PS^PV^F^V. That is to say, instead of applying tho matrix 
operator F^ to the second suffix of T, we can first apply the change repre- 
sented by P, apply the matrix operator to the first suffix, and then undo the 
change P (since P~ 1 = P). Evidently the operation P interchanges the two 
suffixes. It may include in addition a self-reciprocal operation Q which 
commutes with E^, so that QEpQ Ep. But since we may substitute any 
other J57-symbol or jP-symbol for E^ in the foregoing argument, Q must 
commute with all the symbols and therefore be algebraic. Thus Q= 1, 
since these are the only self-reciprocal algebraic operators. 

To determine the sign, consider the special case Y^ = 8 a , where S^ is the 
substitution operator. Then by (10-32) 



Using four-point matrices, the ten symmetrical matrices give 

V^=V=-^ 
and the six antisymmetrical matrices give E^E^ = 1. Hence 

SJ^^= -10 + 6= -4; 
so that P8=-l=:-8. 

Accordingly the sign in (10-37) is verified. 
The operator P can be factorised. Let 



We easily find 

i 

Alan P* 1 P21 PP PP 

xxIBO Xj *, -*2 "~" ' 12""" 2 1* 



160 Wave-tensor Calculus [10-3 

This factorisation is based on conjugate triads ( 3-8), and the factors 
P! , JP 2 are the interchange operators for the two minor complete sets. The 
operator P l is commonly written! 

A= -i(<W + <W + *s*s' + l)s (10-385) 

a form obtained by writing cr^ = t M in (3-87). P l is then treated as the whole 
interchange operator, the factor P 2 being replaced by its eigenvalue 1. 
This is legitimate in the more elementary types of problem for which a single 
set of Pauli matrices suffices. 

We can extend the theory of interchange to double sets. Let K^ L^ 
(JLI= 1, 2, ... 256) be two double frames arranged in corresponding order, so 
that Lp, L v commute or anticommute according as K^, K v commute or 
anticommute. By the method of 2-7, 2-8, we can find an interchange 
operator P KL , such that L^Pj^K^P^ 1 . ^ ^ e ^ are new symbols 
commuting with the JC M , the operator is 



256 

I % Vv < 10 ' 39 ) 



If, on the other hand, the L^ are If-numbers, the factor ^ is replaced by an 
adjustable algebraic coefficient a; also the singular case may arise, requiring 
us to substitute another reflection of K^ . 

10-4. Duality. 

Consider a matrix of the fourth rank which is the product of four wave 

V6Ct0rst ^-^Xr"*. (10-411) 

Since a and y are "first " suffixes, ^ and x are wave vectors in the frame E ; 
^ and o> are wave vectors in the frame F^ . Let 

x F = 0o>*, X = fo*, C7 = ^x*, F = ^o>*. (10-412) 

Then U and V are tensors of the second rank in the frames E and F 
respectively; X F and X, being composed of one vector from each frame, are 
double wave vectors. We have 



(10-413) 

We can associate *F and X with new frames Cj/ and D^' in the same way 
that U and V are associated with the frames E^ and F^ . When the "crossed 
frames" G^ and D^ are adopted, Y and X become wave tensors of the 
second rank and 17 and V become double wave vectors. 

This gives two alternative resolutions of T into matrix components, viz. 
T = S^J^P V =S V / C' /1 / JD; (10-421) 

t Dirac, Quantum Mechanics, 2nd ed., p. 226, equation (35). Dirac uses Pauli matrices 
whose square is + 1. 

| It is not necessary that the factorisation should be actually possible; the theory in this 
section will apply to symbolic factors ( 3-3). 

Unaccented frames C 9 D are introduced later. 



10-4] Doubk Wave Vectors 161 

or, with suffixes, 

2^8 = S fa (E^ Y (E w )p - S V ' (Z^U < jy fr . (10-422) 
The distinctive notation C", D', E, F is not required when suffixes are 
inserted. 

Evidently a double matrix E^F V is a linear function of the 256 symbols 
Cp'ty, and vice versa. We shall determine this function explicitly. Let 



Or, With SUffixeS, (^) ay (JFJ / M = S a-T V W (^rU(^r)jiy- ( 10 ' 432 ) 

Multiply both sides by (^\)8 ( E P )yp The right-hand side is 



The left-hand side is 

(^W 
Hence 

^A P = TVspr(^^^S A ). (10-44) 

As an important example, consider J5 16 F w . Let 



or, with suffixes, -(l) ay (l)^ = S ff . T ^ar(^)8(^r)fr. (10-452) 

Then, by (10-44), '= -&spur(E T E ff ) (10-46) 

o that v' aa = J, v'er = (cj ^ T). Hence 



where P OD ' is the interchange operator of the frames C^' and /}/. Similarly 

OJD^-Pn. (10-472) 

Thus by crossing the frames, algebraic quantities in the frame C'D' 
become multiples of the interchange operator of the frame EF, and vice 
versa. 

We find similarly that E tL F fl = Z a (C a 'D a '), (10-48) 

where the minus sign is to be taken if E a anticommutes with E^ . 

A slight variation of the foregoing process is obtained by taking frames 
Op , Dp such that the connection 

2 7 = S^^ V = S V C^Z) V (10-491) 

stands for T a p - S fa (E^ (E v )& S v (^) aj3 (^) y8 . (10-492) 
This "straight cross" is easier to manipulate analytically, and we shall 
generally use it; though it is perhaps less physically significant than the 
"inverted croBs". If j^=S V>OT <7 Z> T 

we obtain, by the same method as before, 

J r^r). (10-493) 



162 Wave-tensor Calculus [10-4 

where, as usual, (jE T ) aj8 =(JS T )^ a . Hence 



the minus sign applying to the six antisymmetrical (time-like) matrices. 

Ah V 4 5 - IS ( CM ( 10 ' 495 ) 

the minus sign applying to the six electrical matrices a = 01, 02, 03, 04, 05, 16. 
The same formulae hold if EF and CD are interchanged. 

We call the tensor r^ the dual of the tensor t^ v . 

Denoting the double matrices E^ F V9 C^D V by K a , L a (<r= 1, 2, ... 256), the 
equivalent frames K a and L a are connected by a transformation 

L a =qK a q~i. (10-496) 

It should not be difficult to evaluate q as an E jF-number or CD-number; but 
I have not succeeded in doing so. The calculation is complicated because it 
is found to come under the singular case; the value of P KL calculated from 
(10-39) is zero, so that q^P KL - *&*& same applies to the transformation 
connecting E tL F v and C^DJ. 

Without actually evaluating q, we see that the crossing of frames is 
included in the relativity transformations of the double frame. We can 
regard the transformation as a continuous one, giving a sequence of double 
frames intermediate between EF and CD. 

10*5. Significance of the Grossed Frames . 

According to the uncertainty principle there are two extreme ways of 
specifying a particle: (a) the momentum factor may be specified exactly, 
so that the position vector is entirely uncertain, and (b) the position vector 
may be specified exactly, so that the momentum vector is entirely uncertain. 
Wo shall find that the effect of crossing frames is to transform specification 
(a) into specification (b). That is to say, a particle which has the specification 
(a) in the EF frame has the specification (b) in the CD frame, and vice versa. 
Double frames enable us to express symmetrical space tensors of the 
second rank in wave tensor form. (A single frame provides only for the anti- 
symmetrical tensors of the second rank.) Consider, for example, the outer 
square p^p,, of a momentum vector jp M . We call Ppp the energy tensor, f 
or more precisely the self -energy tensor of the particle. In ordinary relativity 
theory the energy tensor of a particle is j^Pv/w; but the factor m belongs to 
a later stage in the adaptation of our equations to practical conditions, and 
can be omitted for the present. By (6-51) the four-dimensional momentum 
vector is represented by EE^p^ and the energy tensor is therefore repre- 
sented by ZXffrtpp, Oi, v = 1, 2, 3, 4). (10-51) 



t Both the momentum vector and the energy tensor include momentum and energy. It 
is an accident of current nomenclature that they are named differently. 



10-5] Double Wave Vectors 163 

If the momentum vector is exact, we may take its direction as our time 
axis. The energy tensor (10-51) then reduces to a single component 

aJS? 45*45- (10-521) 

To represent an entirely uncertain momentum we must distribute the pro- 
bability evenly over momentum vectors in all directions in space-time. The 
resultant momentum will be zero, but the resultant energy tensor will be 
definite. "Entire uncertainty" implies that there is no preference for any 
direction in space-time; the resultant energy tensor must therefore be of a 
form invariant for rotations and Lorentz transformations. The only in- 
variant tensor of the second rank is S^. Hence 

s^^-jsy, 

where j8 is an invariant, and the summation extends over all the vectors 
which compose the probability distribution. Then by (10-51) the resultant 
energy tensor is fi ^ ., 

P ( A i6*i5 + #25*25 + ^35*35 + ^45*45)- (10-522) 

To extend this from simple space vectors to complete space vectors, we 
define the complete self -energy tensor to be the outer square of the complete 
stream vector. If the complete stream vector is exact, it can be reduced 
to the form a* (E l + E 2 ^+ 46 + E u ), and the complete energy tensor is 

a (^ + EM + # 46 + E u ) (1\ + JF M + F^ + JP 16 ). (10-523) 

This refers to an elementary charged particle; for a neutral particle the 
complete tensor is still of the form (10-521). 

If the stream vector is entirely uncertain, the distribution must be such 
that the resultant energy tensor is invariant for all appropriate relativity 
rotations. The natural generalisation of (10-522) in 

10 
02^*;. (10-524) 

But this corresponds to a probability distribution which is symmetrical 
with respect to an electrically saturated space-time. To obtain symmetry 
with respect to neutral space-time, we must substitute neutral sets (6-412). 
Distinguishing kinematical and electrical matrices by suffixes k and e, the 
resultant complete energy tensor is then 

^(S A ^^~S e ^^) = 4j8(7 45 jD 45 (10-525) 

by (10-495). Comparing with (10-521) we see that this corresponds to a 
neutral particlef with definite stream vector in the CD fiame. 

This establishes the important result that for a neutral particle an entirely 
uncertain distribution of momentum in the EF frame corresponds to an 
exact momentum in the CD frame. (The momentum is here understood 
to include energy, spin and magnetic moment.) 

t When the complete uncertainty extends to all components of the stream vector, the 
charge must be completely uncertain. 



164 Wave-tensor Calculus [10-5 

We see also that the straight cross CD is involved because we take a 
neutral frame as standard. The theoretically simpler inverted cross C'D' 
corresponds to a saturated EF frame. 

The position vector can be treated similarly. We can define a complete 
position vector which includes, for example, the angular coordinates con- 
jugate to the spin momenta. For a neutral particle it reduces to the ordinary 
four-dimensional position vector, since there is no spin or magnetic moment 
to be described. The outer square of the position vector will be called the 
position tensor; it plays an analogous part to the energy tensor in the fore- 
going theory. 

The energy tensor and the position tensor have definite values even 
when the corresponding vectors are uncertain; but for brevity we shall 
call those of type (10-521) "exact", and those of type (10-525) "uncertain". 
If a particle is represented by an exact energy tensor and an uncertain 
position tensor in the EF frame, it is represented by an uncertain energy 
tensor and an exact position tensor in the CD frame. If the scale relation 
is appropriately chosen, the exact energy tensor of the EF frame is identical 
with the exact position tensor of the CD frame; and similarly the two un- 
certain tensors are identical. In short the position space of the EF frame 
is the momentum space of the CD frame and vice versa. 

We have here adopted the four-dimensional point of view, in which the 
position vector is measured from an origin in space-time. This was necessary 
in order to compare our formulae with the ordinary form of the uncertainty 
principle; but it slurs over the effect of curvature of space-time.t In five- 
dimensional representation the position vector is measured from an origin 
at the centre of curvature of space-time; and in the standard notation an 
exact position tensor has the form 

a^ 5 ^ 6 = JaS(C' /A Z{ i ), (10-53) 

where the minus sign refers to p = 04, 14, 24, 34, 54, 16, by applying (10-493). 
The connection between the position space and the momentum space is, as 
it were, rotated by the matrix JSJ 4 . The connection is expressed more simply 
by correlating the four-dimensional density of the position vector to the 
three-dimensional density of the momentum vector. Each of these gives a 
second-rank tensor of the form J 

*tfi6*i0= i (S.C^- S^Zp, (10-64) 

t If the position were entirely uncertain in infinite space, the position tensor would be 
infinite. In assuming it to be finite, we have in a sense taken account of curvature of space- 
time; but it has not been necessary to introduce the curvature in any other way. 

% Multiplying the position vector (matrix E 6 ) by iE 6 to obtain its four-dimensional 
density, and multiplying the momentum vector (matrix E t6 ) by iE^ to obtain its three- 
dimensional density, the result is in each case an algebraic quantity (matrix E u ); so that 
the corresponding second rank tensor is &E U F 16 . 



10-6] Double Wave Vectors 165 

where s and t refer to the space-like and time-like matrices. It is these 
densities which become interchanged in the transformation EF-> CD. 

Formally an energy tensor ^ t E tL F v p VL p v refers to two particles or systems, 
since it introduces two frames. It is a special case of a mutual energy tensor 
of two particles ^E^Fyp^p^ formed by multiplication of their stream vectors 
SJS^pp, I,F v p v ', as the combination of probabilities requires. A self -energy 
tensor is obtained by imposing the constraint p^ 'p^' (or alternatively 
PH~ JP//); this constraint greatly limits the number of relativity trans- 
formations applicable. There are two possible interpretations of such a 
constraint; either the second particle is a fictitious duplication of the first, 
introduced for formal purposes, or the centroid of the two particles is 
deemed to be fixed so that their stream vectors are automatically made 
equal and opposite. 

This study of a double frame throws considerable light on the relation of 
position and momentum in the matrix theory. But it is only a step towards 
the actual conditions in which position and momentum are known obser- 
vationally; and further progress will be made in Chapter xi. 

10-6. The 136 -dimensional Phase Space. 

When four-point matrices are used, the frames JE^, F^ each contain ten 
imaginary and six real matrices. Hence the frame E^F V contains 

(10 x 10) + (6 x 6) = 136 real matrices 
and (6 x 10) -f (10 x 6) = 120 imaginary matrices. 

Since (E fl F v ) 2 =l 9 the eigenvalues of E^F V are real. Accordingly, by the 
definition in 7-3, the 136 real matrices are space-like and the 120 imaginary 
matrices are time-like. 

The general infinitesimal transformations for covariant final and contra- 
variant initial double wave vectors are 

T'-e^T, X*' = X*e-*, (10-61) 

where d&^^K^O^. Considering a single term 0^, the two transforma- 
tions in (10-61) are the same if 

exp&(^)r(E v )M.V^^ 
That is, if - ( jy ya (E 9 )y - (E^ y (E v )p 8 

or -KpF^EpF,. (10-62) 

This requires that E^F V should be imaginary or time-like. Thus when E^ F v 
is time-like, a covariant wave vector transforms like a contravariant wave 
vector, and a covariant wave tensor transforms like a mixed wave tensor. 

Proceeding as in 7-2 we obtain double strain vectors (covariant double 
wave tensors), which behave like double space vectors for time-like trans- 
formations and substitute antiperpendicular rotations for relativity 



166 Wave-tensor Calculus [10-6 

rotations in space-like transformations. The latter generate a 136-dimen- 
sional phase space. 

The theory of ten-dimensional phase space applies without important 
modification to 136 dimensions. Stereographic coordinates can be intro- 
duced; and the result corresponding to (7-59) is 

V^7= .R- 136 (1 + r 2 /4JR 2 )- 136 . (10-63) 

Since (EpF v ) 2 =l, the circular rotations are of the form e iE ^ fvU ^, where 
Up V is real. Thus the matrix d 8 for real displacements in phase space is 
imaginary, as in simple phase space (7-31); but the coefficients 0^ (unlike 
the 0^) are imaginary. 

The condition E^ F v = E^F V satisfied by the space-like matrices means that 
they are symmetrical for an interchange of the suffixes a, )8 with y, 8 in 
(10-492). There is another kind of symmetry which corresponds to the inter- 
change of a, y with j8, 8, i.e. interchange of first suffixes with second suffixes. 
The double matrices y^ v introduced in 10-3 have this kind of symmetry, 
and the ^ have the corresponding antisymmetry. It is inconvenient to 
work with y^ v and J^, since they have no simple commutative properties 
and their squares are not algebraic. But the two kinds of symmetry are 
interchanged by crossing the frames as in (10-492); hence the space-like and 
time-like matrices of the CD frame have the same symmetry and anti- 
symmetry as the y^ v and l^ v . It is easily seen that the 136 y^ v are linear 
functions of the 136 space-like matrices of the CD frame and the 120 
are linear functions of the 120 time-like matrices. 

Thus the separation of T into T 8 + T a in the EF frame (10-361) is the 
same as the separation into space-like and time-like parts in the CD frame. 
For any double system we have two alternative phase spaces according 
as we adopt the EF frame or the CD frame. If one of them corresponds to 
classification of configuration by position, the other will correspond to 
classification by momentum. If a simple E or F phase space gives a classi- 
fication by position, the EF phase space will give the position configurations, 
and the CD phase space will give the momentum configurations. For 
example, an algebraic displacement in EF space represents change of time, 
and in the CD space represents change of energy. Since the points of CD 
space represent distributions with fixed momentum, generally called 
elementary states, we may regard the CD space as composed of states and 
the EF space as composed of configurations. 

It is perhaps not self-evident that the EF space gives the same kind of 
classification as the simple E and F spaces. t For a crucial test consider 
an electrical displacement with matrix E l for the space vector or j 23 for the 
strain vector. To give this displacement to both particles we must apply 

t It had occurred to me that the process of amalgamating two simple systems to form a 
combined system might involve crossing their frames. The test here given dispels this idea. 



10-7] Double Wave Vectors 167 

the transformation q**eto*+**t 9 to the double strain vector. Since this is 
a y 2 3,ie rotation it gives displacement in CD space; there is no displacement 
in EF phase space (or in the separate E and F phase spaces) since the 
matrices E 2S9 F Z3 (or H n F M9 S l9 F n ) are time-like. It can scarcely be sup- 
posed that we create a positional displacement of the double system without 
displacing the simple systems positionally; but the displacement in CD 
space can well be interpreted as a change of electrical energy, which has no 
counterpart in a simple system. This indicates that the E F phase space in 
which no change occurs is the positional space, and the CD phase space is 
the momentum space. 

The following property of CD phase space is of interest. If we divide a 
double wave vector into symmetrical and antisymmetrical parts 1 F*, X F, so 
that X F = Y*, Y= - Y, we have, by (10-37), 

H* = i(l-P)Y, Y*=H1 + P)Y. (10-64) 

Consider a transformation V = <f. If q commutes with P, we obtain 

( x F')* = g x F, (')=}. (10-65) 

But for a non-commuting g, say q = 1 + ^,,0, 



(10-60) 

Thus *F* and ^ a are kept distinct in the y^ v transformations but not in 
the Jp,, transformations. The former comprise all the displacements in CD 
phase space. Thus CD phase space has the distinctive property that a strain 
vector of the form Y*O* remains of the same form at all points of phase 
space; similarly for the forms x F"O a , *p<I> a . Thus states determined by 
symmetrical and antisymmetrical wave vectors X F*, *F a form distinct sys- 
tems not transformable into one another by strain. 

This separation of symmetrical and antisymmetrical wave functions plays 
a prominent part in the current treatment of double systems, as developed 
by Fermi and Dirac. Since it applies to CD space but not to EF space, it 
applies to states not to configurations. Since this is in accordance with the 
current interpretation, it confirms our identification of the two phase spaces. 

10*7. Representations of Two Charges . 

The strain vectors representing elementary charged particles have been 
given in (6-64). Using the matrix representation in (3-61), we resolve them 
into their wave vector factors 



8 C =m (1, - 1, -i, i) (1, - 1, i, -i), 

5jm(l, - 1, i, -i) (1, - 1, -*, i). (10-71) 



168 Wave-tensor Calculus [10-7 

First consider two charges of opposite sign and the same spin, whose 
wave tensors may be taken to be 



so that (omitting the masses m, m') we have 

= w = (l, 1, -i 9 -i), Xss j = (i 9 i f t,i). 

The double vectors, obtained by a straight cross, are therefore 

T = ^* = ^, X = X o>* = ^. (10-72) 

If the masses are inserted, T and X can be given any coefficients m", m'" such 
that ra"w'" = mw'. 

As another example take two charges of the same sign and spin, with 



Then Y = ^* = (l, 1, -i, -)(!, 1, -i, -i) 

=1 i -i -; 

1 1 -i -i 



so that the double wave vectors are degenerate (5- 56). 

A combination of two charges of like sign and opposite spin, or of opposite 
sign and spin, likewise gives degenerate double wave vectors. 

We see that there is a simplicity in the combination of charges of opposite 
sign with like spin, which is not exhibited by any other combination. The 
CD and the EF representations are interchangeable, and the double wave 
vectors are non-degenerate. The physical interpretation of this peculiarity 
is that when charges of opposite sign with like spin are superposed, their 
electric and magnetic fields cancel. They thus constitute a self-contained 
unit which can be inserted in a background of neutral space-time without 
disturbing its neutrality. Other combinations cannot be treated as isolated 
units for they disturb by their external fields the surrounding matter. In 
their case we must take into account, in calculating probabilities for the 
purposes of statistical theory, not only the probability of the configuration 
of the two particles, but the probability of the polarised configuration of the 
rest of the universe. 

If then we wish to treat what is strictly a two-body problem, we must 
confine ourselves to particles of opposite sign and like spin. 

To understand the nature of the superposition referred to above, we must 
recall that the momentum vectors of the two charges are given definitely 



10-75] Double Wave Vectors 169 

by S a and S b . Hence, by the uncertainty principle, their positions are 
entirely indeterminate. We are therefore superposing two probability 
distributions which extend uniformly over the universe. It is not a question 
of superposing localised particles; they would be disturbed by interaction 
effects which we have not yet studied; and their exact superposition is, in 
fact, inhibited. The theory of this chapter is limited to the combination of 
independent probabilities. 

It will be seen from (10-71), or directly from (tf-64), that 



By making an inverted cross we obtain double vectors l l'" = 0w*, 
X' = <x*. These are non-degenerate if the two charges are of like sign 
and spin, and are degenerate for all other combinations. We have already 
noticed that the inverted cross bears the same relation to electrically 
saturated space that the straight cross bears to neutral space; and the 
non-degenerate combination of two charges of like sign and spin would 
evidently fit saturated space in the same way that two charges of opposite 
sign and like spin fit neutral space. 

10-75. Importance of Double Tensors. 

The fundamental tensors of relativity theory are symmetrical space tensors 
of the second rank, viz. the metrical tensor g^ v and the energy tensor 
G^- Ig^O. These must appear as double wave tensors in the wave-tensor 
calculus; and it is clear that any serious attempt to unify relativity theory 
with wave mechanics must be based on double wave tensors. In macro- 
scopic relativity theory the interval ds between two points is regarded as 
the fundamental observable; correspondingly in relativistic wave mechanics 
the double wave tensor of two particles is the basis of description. 

In treating a system of n particles, we naturally introduce a corresponding 
n-tuple wave tensor. But the step from a double to an ra-tuple wave tensor 
is not comparable with the step from a single to a double wave tensor. In 
relativity theory we do not recognise any relation between three points 
analogous to the interval between two points; nor are there any space 
tensors of the nth rank analogous to the metrical tensor and the energy 
tensor. In proceeding from simple to double wave tensors we approach the 
meeting point with ordinary relativity theory; in proceeding to treble and 
n-tuple wave tensors we diverge again. Quantum physicists have them- 
selves recognised that the specification of a system of n particles cannot well 
be left in the form of an w-tuple wave tensor, and have introduced another 
method of specifying such systems by a sum (not product) of Jordan- Wigner 
wave functions, which will be used in Chapter xvi. 

The number 136, being the number of dimensions of double phase space, 
will be prominent in our formulae for the principal constants of nature. This 



170 Wave-tensor Calculus [10-75 

does not imply that their application has special reference to systems con- 
sisting of two particles. It implies that in wave mechanics, as in relativity 
theory, the study of a complex system is based on the relations between pairs 
of particles contained in it, and on the tensors of the second rank which 
embody these relations. The number of dimensions of double phase space 
occurs in the formula for the mass of an electron or proton, because mass is 
by definition contained in a space tensor of the second rank equivalent to 
a double wave tensor. 

The quadruple tensor also plays a rather special part in the theory. If a 
primitive entity or relatum is represented by a simple wave tensor, we require 
two relata in order to provide an observable relation. To assign measure to 
this relation we require another observable relation with which to compare 
it. Thus a measure involves four relata two to provide a quantity to be 
measured and two to provide a unit of comparison. It therefore depends on 
a quadruple wave tensor. This coincides with our conclusion in general 
relativity theory :f 

Thus four points is the minimum number for which an assertion of absolute 
structural relation can be made. The ultimate elements of structure are thus 
four-point elements. 

Thus simple, double and quadruple wave tensors appear in the theory; 
but there is no occasion for introducing any other combination for the 
purposes of fundamental study. For systems of more than four particles 
(or two particles plus a "comparison fluid") we abandon the method of 
multiplication of probabilities, and develop a procedure (formally investi- 
gated in Chapter xvi) which is in a sense intermediate between macroscopic 
theory and elementary wave mechanics. 

Generally the quadruple wave tensor remains in the background of the 
theory; we need only occasionally remind ourselves that the double wave 
tensors, which we employ, require such a background. We shall generally 
regard the quantities described by double wave tensors as observables, in the 
same way that the interval ds is usually regarded as an observable although 
any numerical value that is attached to it expresses its relation to another 
interval. 

10-8. Relative Coordinates . 

Let Xp, Xp be the coordinates of two points in four dimensions, associated 
respectively with symbolic frames E^, F^ . We make the transformation 



which gives x^ = A^ + f^ , x^ = X M - ^ . (10-812) 

t Mathematical Theory of Relativity, 98. 



10-8] Double Wave Vectors 171 

Then the x^ are coordinates of the centroid, and 2 M are the coordinates 
of one point relative to the other. The corresponding formulae for the 
derivatives are a a 9 a 3 a 

a^-gi' + aV' K^T'top top" 



fi x nl^_. ' i + It -3 /~nl3,- 3 f (lU-0 w) 

We are going to find two new complete sets G^ , H^ which, we shall show, 
are associated with x^, ^ in the same way that E^, F^ are associated with 
x n> x u> so ^at IL is the frame whose rotations determine the vector 
transformations of the relative coordinates 2^ . 

Let P be the interchange operator ( 10-3) of E^ and J5J, , and let 



for p,= 1, 2, 3, 4. We here write V for an algebraic square root of 1, not 
necessarily the same square root as E 1B and J^ 16 . Solving (10-831) for E^ 9 F^ 
we find p p 

^rS^-*" 7 ^' ^-fr?^-^- (10>832) 

Then 



Similarly 0^0, = (1/2*0 

(10-841) 



Since ^, v are restricted to the values 1, 2, 3, 4, the right-hand side is anti- 
symmetrical in p. and v. Therefore G^ G v =-G v G fA . Similarly H^ H v = - H v H^ . 
Also 



(10-842) 
These two expressions are equal, so that O^H^ I^G^. 

These results show that G l9 2J G 39 C? 4 and 15^ , // 2 9 H.^H^ are tetrads, and 
that the H'B commute with the (?'s. From these tetrads we construct two 
complete sets 0^, jfi^ (fi= 1, 2, ... 16) which commute with one another. By 
(10-842) GHl2i f )(F. + i f E A )^E l F A (10-843) 



for p, = 1, 2, 3, 4. This result can be extended to all values of ^; e.g. 
G^H^G^^H^G^.G^^E^.E.F^E^F^. 

Hence by (10-32) the interchange operator P for E^ and J^ is also the inter- 
change operator for G^ and H^ . 



172 Wave-tensor Calculus [10-8 

The Gp and H^ not belonging to the original tetrad do not obey (10-831). 
In general they have no simple connection with the corresponding E^ and 
F^\ but the result for G b , # 5 is of interest. We have 



\iP (Ei + VPJ P (E 2 + t' 



2 EM\-i'EiFi-EuFu-i'E u Fu-EsFs-...). (10-861) 

The bracket contains 16 terms. We have given one specimen of each type 
of term . All terms of the same type have the same coefficient ; the verification 
of this is an interesting study of the operation of the permutations. We 
obtain similarly from (10-8.5) 



(10-862) 
Multiplying (10-861) by E$ and (10-862) by i'F b and adding, we obtain 



Hence, multiplying initially by P and finally by G 5 , 

#5 = -(J^ + t'JU- -G f ', (10-863) 



where G 5 ' is the symbol defined analogously to O l9 G 2 , G 3 , 6? 4 by putting 
ft = 5 in (10-831). A similar result is obtained for H 5 . 

Thus we can, if we wish, extend the present investigation to relative 
coordinates in five dimensions, defining 6?^, ff /x (/x= 1, 2, 3, 4, 5) by (10-831); 
but in that case if E^, F^ are right-handed sets we must construct G^, H^ 
as left-handed sets. Since we have at present no use for this extension, we 
keep to right-handed sets, and accordingly define 6? 5 as i6r 1 6r 2 (7 3 6r 4 . 

By (10-811) and (10-831) we have 



4 4 

Or writing x = S E^ , x = S G^ x^ , etc., 



(10-871) 



(10-872) 

For example, if we have two non-interacting particles of proper mass m 
which satisfy the wave equation (8-262) without electromagnetic field, viz. 



Similarly, if V a = 2^3/9^, V i= = 2^9/3^, etc., 
i ^ i 

op 
' 



10-81 Doubk Wave Vectors 173 

the product wave function T satisfies both equations, and therefore satisfies 



Hence, multiplying by 2P/(1 + i'), 

0. (10-88) 



A noteworthy point is that the vectors x, x', and also the vectors x, 5, 
have to be combined with a quarter period difference of phase represented 
by i'. There is evidently no other way of obtaining a transformation of 
the kind required. If we try to combine them without phase difference, 
we have 



The trouble then arises that E^ + F^, E^-F^ are singular symbols with 
eigenvalues and 2i. The product of two vectors (E + -P\) x^ , (E - 1\) & 
in the same direction in the absolute and relative spaces is identically zero. 
This means that we have made a singular transformation, so that either the 
external (absolute) configurations or the internal (relative) configurations 
have shrunk to a point in our representation. 

In the transformation here found A-^ and ^ are referred to symbolic 
frames which are equivalent to one another and to the original frames of 
x and x'. Thus the relative coordinates can be treated in the same way as 
the absolute coordinates hitherto treated. It would be more logical to say 
that the absolute coordinates can be treated in the same way as relative 
coordinates, for all our experience is concerned with relative positions. But 
to render the representations similar we have had to measure the co- 
ordinates in various complex units. This is seen by writing (10-871) in the 
form - 



There does not seem to be any way of extending the foregoing transforma- 
tion to particles of unequal mass. On reflection we see that no such extension 
could be expected. The integrated mass of a particle distributed uniformly 
all over the universe is a highly artificial conception, and has no clone 
relation to the concentrated masses of classical particles. Particles with 
different masses (protons and electrons) do not appear in the theory until 
a later stage of development than that which we have now reached, and their 
wave functions are not the primitive relativistic wave functions, introduced 
by Dirac's equation, which we are at present studying.! 

In the next section we treat a more familiar transformation of a somewhat 
different character which concerns particles of unequal mass. 

t See 11-6. 



174 Wave-tensor Calculus [10-9 

10*9. Relative Coordinates Unequal Masses. 

For two particles of proper mass m, m', the coordinates x^ of the centroid 
and the relative coordinates f^ are given by 



Hence m 38 m' 3 ___ d_ no .Q 12 \ 

l ' 



Writing M=*m + m', p, = mm'/(m+m f ), (10-92) 

we obtain from (10-912) the well-known formula 

IJL+JL^I^.+AJL (10 -93) 

Introducing momenta, ^= id/dx^, m = idfi^, etc., this becomes 

In the transformation (10-911), the jacobian 

|i5f^=l (10-95) 

so that the transformation x^ , x^-^x^ , ^ does not alter volume elements. 
Normally (10-911) applies to space-coordinates only, the time being the 
same for the two particles. We can, if occasion arises, extend it formally to 
four coordinates; but it must then be remembered that the relative time 
r = t t' is not the time which is ordinarily associated with relative co- 
ordinates f ^ . 

We notice especially that 

mm' = Mp. = M , say. (10-96) 

We have seen ( 10-1) that a system is to be regarded as the product, rather 
than as the sum, of its parts; and from this point of view the product mass 
M is its leading dynamical characteristic. The two ways of factorising M 
lead to two modes of dividing the system either into two ordinary particles 
m, m', or into an external particle M and an internal particle p. These two 
modes of factorising M are familiar in classical mechanics, especially in 
celestial mechanics, where p is called the reduced mass associated with the 
relative orbit. 

If p 2=s Pi 2 +p2 2 +Pa*> etc., we obtain by summing (10-94) 

p 2 /m +p' 2 /m' = P*IM + t& 2 //*. ( 10-97) 

For simple plane waves the energy was found in (5-251) to be 

/n *?) /^/ //>* 2 _L /yi2\i _o_ <**i _L i/n2//*yi /lO*Qfil\ 

The conditions described by plane progressive waves are, however, highly 
abstract, and (10-981) has no immediate interpretation in terms of obser- 



10-9] Double Wave Vectors 175 

vation.f All observed motion is relative motion, and the energies with 
which we are concerned observationally correspond to relative coordinates 
1^. Clearly energy must be defined in such a way that either mode of 
division of the system gives the same total energy. By (10-97) the function 
of p, which is conserved in the transformation from P, w to p, p', is p 2 /ra; 
the variable part of the energy must be equal or proportional to this. We 
therefore set the energies equal to 



, 

where a is a constant coefficient. These satisfy 



(10-983) 

We notice in (10-982) that the relative energy e does not include any rest- 
energy. 

It has commonly been supposed that the energy m + op 2 /w is merely an 
approximation to (m 2 + jp 2 )*, the value of a being \ . That is not so ; m + ojj*/m 
is the exact expression in the conditions with which we are here concerned, 
and (10-981) is irrelevant. According to the common assumption tho 
variable part of the energy is {(m 2 +p 2 )l m}, and it is taken for granted 
that the relative energy is also of this form, viz. {(p 2 + iD 2 )^ //,}. The latter 
expression is the one which concerns us observationally, since we can only 
observe relative positions and energies. But 

{(m 2 +# 2 )* - m} + {(m /2 +y ^ 

Thus the sum of the internal and external energies of the system is not 
equal to the sum of the energies of the constituent particles ; and it is 
clear that the form used is inadmissible. 

We shall have more to say on this point in Chapter xni. Meanwhile the 
following explanation must suffice. The expression for the energy operator 
in terms of the momentum operators p l9 p 2 , p 3 (the three-dimensional 
hamiltonian) is a leading dynamical characteristic of a system. What we 
here find is that the two parts into which we divide a system have not the 
same dynamical characteristic as the set of plane waves studied in 5-2. 
It was scarcely likely that they would; for our first investigations were 
concerned with abstract conditions, and it is only with the introduction of 
double wave functions that we begin to make some approach to actuality. 
In order that the wave equation #0 = of a simple wave vector may be 
invariant for rotations of space-time, H must be a space vector. Corre- 
spondingly, if the wave equation of a double wave vector is // 2 *F = 0, // 2 

f Its application to the observed change of mass with velocity of an electron cannot be 
described as immediate, since the mass is distributed over an infinite wave front. To connect 
this with the change of mass of an approximately classical particle it is necessary to in- 
vestigate the transformation of the normalising conditions. 



176 Wave-tensor Calculus [10-9 

must be a space tensor of the second rank. If we divide H 2 into two parts 
H 2 ' + H referring to the two particles, H 2 ' and H% may or may not be 
tensors separately; but at any rate they will not be space vectors. Thus the 
vector form of H, which yields p Q = (m a -f jp 2 )*, is not applicable to the con- 
stituents of a double system; and p*/m occurs in (10- 982) in its own right as 
a component of a tensor of the second rank, not as an approximation to a 
component of a vector. 



PAET II 
PHYSICAL APPLICATIONS 



CHAPTER XI 
THE RIEMANN-CHRISTOFFEL TENSOR 

11-1. The Comparison Fluid. 

When the position of a body is determined by observation it is referred to 
landmarks furnished by material objects; but in theoretical formulae the 
position is supposed to be specified relatively to a geometrical frame. In 
macroscopic physics the geometrical frame is an admissible substitute for 
material reference marks; and there is no inconsistency in assuming that 
the position, momentum, etc., defined by reference to points of a geometrical 
frame, satisfy the same equations as the position, momentum, etc., measured 
from material landmarks. But it is not so in wave mechanics. A material 
frame of reference cannot be equivalent to a geometrical frame. For the 
material system is specified by a wave function which describes a pro- 
bability distribution of its observable landmarks relative to a geometrical 
frame. To identify the position and motion of the landmarks with the 
position and motion of a geometrical frame contradicts the uncertainty 
principle. 

Thus in so far as the equations of wave mechanics relate to anything 
observable, they contemplate (a) a geometrical frame, (6) a physical refer- 
ence object or system of objects, and (c) the particular particle or system 
under discussion. The geometrical frame cannot be omitted without aban- 
doning the whole method of wave mechanics, but it is only an intermediary 
and not the final reference system for our observations; (6) and (c) are neces- 
sary in order to furnish observable phenomena for measurement. For 
simplicity the casual reference objects (6) used in actual experiments are 
replaced by an ideal standardised reference object. The fundamental 
equations of physics necessarily contemplate highly idealised systems in 
highly idealised conditions; and they select a reference object with simple 
and symmetrical properties, just as they select for investigation very 
elementary systems (c). But the idealisation must not be carried so far as 
to substitute something which 'has not the same relation to our sensory 
experience. Idealisation is permissible; abstraction is not. The practice in 
current quantum theory of substituting a sharp geometrical frame for a 
probability distribution of observable landmarks is clearly indefensible. 

Probably it has been thought that the difference between (a) and (6) 
would be made insensible by using very massive material landmarks. But 
a massive system causes curvature of space. Current quantum theory 
neglects curvature of space, and therefore falls into error either way: either 
it postulates that a light reference object is used and wrongly neglects its 

X2-2 



180 Physical Applications [11-1 

uncertainty of position and velocity, or it postulates a heavy reference 
object and wrongly neglects the resulting curvature of space. Perhaps the 
most important insight, obtained through a combination of relativity 
theory and wave mechanics, is a realisation that the two alternatives are 
different forms of the same error. 

The magnitude of the error introduced by failing to distinguish (a) and (6) 
is a matter for detailed calculation, and will be found in due course. We need 
only say here that the common impression that it is likely to be trivial is 
very far from the truth. 

The idealised physical reference object, which is implied in current 
quantum theory, is a fluid permeating all space like an aether. Such an 
aether is in a sense a materialisation of space; or better, space (the geo- 
metrical background or metrical field) should be looked upon as a de- 
materialised abstraction of the adopted comparison aether. " Dematerial- 
isation" is represented analytically by replacing a probability distribution 
by a sharp configuration. The uncertainty principle does not allow such 
replacement until we have removed from the comparison fluid those 
characteristics which make it accessible to observation so that it no 
longer serves the purpose of a reference mark for observational measure- 
ments. 

We shall call the three constituents of any problem (a) the frame, (6) the 
comparison fluid, (c) the object system. 

In future developments frequent reference will be made to the com- 
parison fluid.t It must be understood that the comparison fluid is not a 
hypothesis. Still less is it an ascertained feature of the universe. The com- 
parison fluid is a datum of the elementary problems which we treat like 
the "frictionless constraints" so often specified as data in problems in 
elementary mechanics. There is no suggestion that a comparison aether 
actually exists throughout the universe. But, instead of a comparison 
aether, irregular distributions of matter exist and furnish landmarks for 
our observational measurements; the comparison fluid takes the place of 
these in our elementary equations. From these we can, if necessary, proceed 
to more complicated equations which take account of the irregularity of the 
background of our actual experiments. 

If anyone is disposed to offer criticism of this view of the equations of 
wave mechanics, it is necessary that he should state what (in his view) is the 
physical reference object intended by writers on wave mechanics when they 
mention positions, momenta, etc. For, if he believes the quantum equations 
to be true observationally, he must surely be prepared to state what observ- 
able system he believes them to be true of. We would remind him that a 

t The " standard environment*' occasionally referred to in earlier chapters is now replaced 
by the comparison fluid. 



11-2] The Eiemann-Christoffel Tensor 181 

geometrical frame is not observable; nor is it a simplified representation 
of anything observable ; its sharpness is, according to the principles of wave 
mechanics, incompatible with observability. 

The axiom of relativity is that we can only observe relations between 
physical entities. A relation implies two systems or two parts of a system; 
it can therefore occur only in connection with a double wave function, the 
duplicity corresponding to the two ends of the relation. 

It is therefore evident that the union of wave mechanics with relativity 
theory must be based on the theory of double wave tensors. As we pointed 
out in 10-75, the recognition that the simple wave tensor is an abstraction, 
and that the double wave tensor is the primary physical concept, is the 
counterpart in relativistic wave mechanics of the recognition in macro- 
scopic relativity theory that the interval is the primary physical concept. 

This conclusion is emphasised by the fact that the fundamental tensors in 
relativity theory are symmetrical tensors of the second rank, namely, the 
metrical tensor g^ v and the energy tensor T^ v . In symbolic notation these 
yield double wave tensors ^g^E^F^ and SIJ^JB^JJ,. On account of its 
antisymmetrical properties the Riemann-Christoffel tensor (although it is 
of the fourth rank) can also be expressed as a double wave tensor 



In the more elementary problems, the double wave tensors refer to the 
object system and comparison fluid, and specify the probability distribution 
of their combined configurations. What are ordinarily described as pro- 
perties of the object system must be understood to mean properties of the 
combination object system 4- comparison fluid. 

11*2. Linked Rotations. 

In general relativity theory, a vector A^ carried by parallel displacement 
round the perimeter of a surface element d8 va receives an incrementf 

dA^V^&lB^A'dS", (11-21) 

where B is the Riemann-Christoffel tensor. We have changed the usual 
order of writing the suffixes (which would have been B^ VOf ), since it is found 
to be very inconvenient in the present theory. We shall adopt local Galilean 
coordinates (natural coordinates); since real time is used, it is necessary to 
pay attention to the upper and lower positions of the suffixes. 

Let dS va be a circle of infinitesimal radius r in the plane x v x a , described 
in the direction x v ->x a . A vector A along the x axis, by receiving an incre- 
ment dAp along the x^ axis, is rotated in the x^x plane through an angle 



so that by (11-21) ^/= -iir 1 ^^. (11-22) 

t Mathematical Theory of Relativity, equation (84-1). 



182 Physical Applications [11-2 

The factor \ has disappeared because the surface element has two com- 
ponents dS va = -d/S ff " = 7rr 2 . 

The components of B therefore specify a linkage of two rotations, the one 
in the va plane and the other in the /*e plane, the first being a rotation of the 
position vector of the point considered and the second being a rotation of an 
arbitrary vector situated at the point considered. In physical applications 
we regard the point as occupied by an object system, and the rotations refer 
respectively to the position vector of the object system and to a vector 
regarded as intrinsic in the object system. We shall call the two vectors the 
position vector and the intrinsic vector. 

In ( 11-22) the angle of rotation of the position vector is va = 27T. The 
ratio of the linked rotations is therefore 



Or, expressing both angles contravariantly, 

0> /0"-= - \r*Bt" va . (1 1-232) 

In " de Sitter space-time ", i.e. space-time spherical in its space dimensions 
and hyperbolic in the time dimension, the Ricmann-Christoffel tensor is 

^%a = (W-S a MS/)/7? 2 . (11-233) 

This vanishes unless the planes /xc, va coincide. If /*, e = v, a the value is 
1/jR 2 . Hence, in de Sitter space-time the two rotations are in the same 
plane, and their ratio is 

e' vff /0 Vff = - ir 2 /JR 2 , (11-234) 

r being the length of the position vector (treated as infinitesimal) and E the 
radius of space-time. 

An illuminating point of view is obtained by interpreting 0' as the rotation 
of a comparison fluid. The intrinsic vector A^ has been parallelly displaced; 
this implies that, although its mathematical specification is altered, it has 
in some physical sense suffered no real change. It is still open to us to define 
the observational significance of the concept "real change"; this definition 
will decide the way in which the analytical theory of parallel displacement 
is to be utilised in the study of physical phenomena. We shall define real 
change to mean observable change. The orientation of the vector A in the 
geometrical frame is not observable; what we observe is its orientation in 
the comparison fluid which is the idealised substitute for material landmarks. 
Thus Ap will suffer no observable change if its direction remains fixed in the 
comparison fluid. Being fixed in the fluid it undergoes the same rotation 0' 
as the comparison fluid relative to the (unobservable) frame. 

The linkage of a rotation 0' of the comparison fluid to a rotation of the 
position vector of the object system is conceived dynamically as a recoil of 
the comparison fluid. If the fluid is assigned a moment of inertia 2w-ff 2 , 



11-2] The Riemann-Christoffel Tensor 183 

(11-234) expresses that when the object system is given an angular momen- 
tum mr^ddfdt the comparison fluid recoils with equal and opposite angular 
momentum 2mR 2 d6' jdt. In non-uniform space-time the comparison fluid 
has (as we should expect) a momental ellipsoid with unequal axes, so that 
the recoil angular velocity dO'/dt is not necessarily in the same plane as the 
recoil angular momentum, and therefore not necessarily in the same plane 
as de/dt. Thus components of ^ va for which /z, c ^ v, a are introduced. But 
we need not unduly stress this dynamical model. The R.C. tensor specifies 
directly the kinematical recoil of the comparison fluid corresponding to 
every possible cyclic displacement of the object system. If we introduce 
dynamical conceptions, we must attribute to the comparison fluid whatever 
dynamical characteristics are necessary to satisfy this specification, whether 
they are illustrated by familiar dynamical models or not. 

It may be noticed that a completely arbitrary connection between 0^' 
and v<r cannot be represented by an R.C. tensor; the arbitrary connection 
would have 36 disposable constants, whereas the R.C. tensor has 20 in- 
dependent components, t 

From the standpoint of macroscopic theory the foregoing is a highly 
speculative interpretation of the R.C. tensor. There is in fact an insuperable 
objection to adopting it in macroscopic relativity theory, which will be 
explained in 11 '3. But in microscopic theory we approach it in a different 
way; the speculative taint is removed; and the aforementioned objection 
disappears as soon as we substitute displacement by wave propagation for 
the classical conception of displacement. 

In wave mechanics the comparison fluid presents itself as a datum of the 
problem; without it the mathematical equations could have no relation to 
observable phenomena. We observe displacements of the object system 
relative to the comparison fluid; but the methods of mathematical physics 
require that we should use in our equations displacements relative to a 
geometrical frame. For convenience we call displacements relative to the 
frame absolute (i.e. conceived as absolute). We have therefore to analyse an 
observed relative displacement into absolute displacements of the object 
system and comparison fluid, before we can apply the theoretical equations. 
This partition can only be decided by convention. The convention is arbi- 
trary in the first place; but it may later be limited so as to fulfil conditions 
which simplify the resulting formulae. 

We have therefore to draw up a scheme of partition specifying the two 
absolute motions which correspond to any given relative motion; or equi- 
valently the scheme will specify the absolute motion of the comparison fluid 
corresponding to any given absolute motion of the object system. We have 

f We confine ourselves to Riemannian geometry. In affine geometry a generalised cur- 
vature tensor with 36 independent constants is substituted. 



184 Physical Applications [11-2 

seen that (for rotational displacements) such a scheme of linkage can be 
expressed by a R.C. tensor B^ V<r . 

Defined in this way as a scheme of partition, B^ va has no immediate 
relevance to curvature of space. But the analysis at the beginning of this 
section shows that, if we ascribe to the unobservable frame a metric whose 
B.C. tensor is B ftt909 the partition represented by B^^ becomes automatic 
on the understanding that displacement without observable change is 
represented geometrically by parallel displacement. "Observable", of 
course, means observable under the ideal conditions of the problem, namely 
that the only physical systems present are the object system and the 
comparison fluid. 

This double aspect of B^ va is the most essential link between relativity 
theory and wave mechanics. By it energy, momentum and mass, which 
appear in relativity theory as components of curvature of space-time, 
become identified with coefficients involved in the partition of relative 
displacements into absolute displacements, and therefore in the analysis of 
the double wave functions (which contain observational information) into 
the abstract simple wave functions which form the ordinary starting point 
of wave mechanics. In this way energy, momentum and mass get their 
footing in wave mechanics. 

The double interpretation of B^ va also exhibits the way in which the 
geometrical conception of mass as curvature becomes translated into a 
dynamical conception. The coefficients of the scheme of partition determine 
the recoil of the comparison fluid. An object particle whose displacements 
produce a large recoil is regarded dynamically as having a large mass. We 
see from.( 1 1-231) that if the B.C. tensor, and therefore the mass, is multiplied 
by any factor the recoil is multiplied by the same factor. 

11*3. Displacement by Wave Propagation. 

In obtaining (11-231) we assumed that the change dA^ recognised at the 
end of a complete cycle had occurred evenly during the cycle. In macroscopic 
physics it is impossible to admit this. When the object system receives a 
displacement dx , we do not know whether the infinitesimal displacement 
is going to form part of a circuit in the x^x^ plane or the x^x 3 plane, and it is 
therefore impossible to assign the plane of the recoil dO' of the comparison 
fluid. General relativity theory has therefore adopted a different represen- 
tation. It admits a rotation (usually accompanied by strain) of a vector 
parallelly displaced through dx\ but the rotation is described by a non- 
tensor quantity a 3-index symbol. The change assigned is thus dependent 
on the coordinate system employed; it cannot be pictured absolutely, or 
attributed to an objective rotation of the comparison fluid. 

This difficulty does not arise in wave mechanics, because we do not con- 



11-3] The Riemann-Christoffel Tensor 185 

template displacements of the classical type. The nearest approximation to 
classical displacement is propagation of a wave packet. But the wave packet 
is analysed into elementary waves, each of which fills the whole of space. 
An elementary displacement corresponds to the propagation of one of these 
waves. To each elementary wave there is a corresponding recoil of the com- 
parison fluid, and the recoil due to the propagation of the wave packet is the 
resultant of these. 

Wave analysis removes all ambiguity as to the plane of rotation, because 
it is a wave front, not a point, which is displaced. The free motion of a particle 
or wave packet is commonly analysed into "infinite plane waves " ; but these 
must be modified to fit finite space. In spherical space-time we can mark the 
wave front on the sphere, and obtain the wave propagation by rotating the 
sphere in the plane defined by the wave normal and the radius of curvature 
of space-time. For particles bound in an atom the analysis into elementary 
states gives waves representing angular motion; these correspond to 
rotations in three-dimensional space. 

It would be difficult to extend to all kinds of irregular space the con- 
ception of wave propagation as a simple rotation. In general the normal to 
space-time is six-dimensional. But wave mechanics evades all such com- 
plications by analysing physical systems into steady states in which the 
motion is characterised by constant angular momentum. Changes of 
distribution which cannot be represented by rotation (in the most generalised 
sense, i.e. displacement without intrinsic change in any of the planes of the 
symbolic frame) are treated by perturbation methods, and are not recognised 
as spatio-temporal displacements. If wave mechanics is unable to describe 
the change from configuration A to configuration B in terms of rotation, it 
represents the change as spatially discontinuous; that is to say, it examines 
the transfer of probability from A to B without representing the system as 
having passed through configurations intermediate between A and B. 
Such changes are called transitions, not displacements. 

The result is that every spatio-temporal displacement contemplated in 
wave mechanics is a rotation, or is analysed into rotations. Indeed the theory 
developed in the previous chapters makes no provision for displacements 
which are not the manifestation of rotation; and every continuous trans- 
formation or change of configuration is associated with a symbol defining 
a plane in a symbolic frame. Since we always know the plane of rotation 
associated with a displacement, the difficulty encountered in macroscopic 
physics does not arise. 

The obvious example in microscopic physics of an object system describing 
a small circle dS va is furnished by the electron in a hydrogen atom. We can 
contemplate the displacement of the electron during a time short compared 
with the time of revolution; but the displacement can only be described by 



186 Physical Applications [11-3 

an angle d0 va , not by a line element dx v , because the position angle of the 
electron at any instant is indeterminate. 

If then we consider an electron with constant angular momentum in the 
plane va 9 



the recoil of the comparison fluid in the plane vo will be, by (11-232), 



dt 
or, assuming spherical space, 

dg/ _ nh m.W 

"*""SS3P' (1133) 

It is desirable to meet at this stage the criticism that an effect of the order 
(11-33) is utterly trivial. The recoil dO'/dt applies to a comparison fluid 
filling the whole universe. There are, say, N other electrons in the universe, 
whose contributions to the total recoil must be added together. If (as seems 
to be generally implied in the elementary formulae) these are all supposed to 
be doing the same thing, the total recoil is 



where R' = R/VN-3 . 10" 13 cm. (11-35) 

according to the value of N found in Chapter xiv. This is by no means 
trivial; and outside the region of the nucleus (r > 10~ 12 cm.), dO'/dtis greater 
than d6/dt. 

If, on the other hand, we suppose that (11-31) refers to one electron only, 
it is right that the recoil (11-33) should be of the same order of observational 
insignificance as the motion which occasions it. To detect that one of the 
N electrons in the universe has acquired a rotation dO/dt can scarcely be said 
to be within reach of practical observation. It is true that we might observe 
a reasonably large effect, but the chances are N to 1 that we shall observe 
nothing; for there is nothing in the mathematical formulation to indicate 
that the electron referred to in (11-31) is the one which the observer is 
watching. 

In natural coordinates the continuum of space and imaginary time is 
isotropic. We have found that, for a spatial displacement of the object 
system, the comparison fluid recoils in the opposite direction; and the same 
holds for an imaginary time displacement. If we substitute real time, the 
sign of the corresponding components of the B.C. tensor is reversed; and the 
real time displacement of the comparison fluid is in the same direction as 
that of the object system. In our ordinary outlook, objects travel forward in 
time together. More particularly in a measurement involving an object 



11-4] The Riemann-Christoffel Tensor 187 

system and physical reference system, it is implied that both systems are 
observed simultaneously in the time reckoning adopted. We have seen that 
the scheme of linkage of displacements is initially an arbitrary convention, 
and for the present we shall treat it as arbitrary; but at the appropriate stage 
we shall specialise it so that the time displacements (in the time reckoning 
adopted) of the object system and comparison fluid are equal, as the 
ordinary outlook assumes. 

11-4. The Riemann-Christoffel Matrix. 

We shall now express the R.C. tensor as a wave tensor. In order to utilise 
the relation between wave tensors and space tensors found in our previous 
work, we must take the fourth coordinate to be imaginary time. Accordingly 
we use local rectangular coordinates with g^ = 8^"; and there is no longer 
any distinction between B* va and i^ tev<r . 

We define the Riemann-Christoffel Matrix to bef 



)y8- (11-41) 

Since the suffixes of the ^/-symbols run from to 5, we require components 
of Bp va additional to those which make up the ordinary R.C. tensor. The 
physical interpretation of the extra components will be found in due course; 
but their primary significance is that they specify linkages of the rotations 
of the object system and comparison fluid in the corresponding planes as 
in 11-2. 

Consider a factorisable matrix (as in (10-413)) 



Treating U and F as double vectors in the CD frame, we have by (10-24) 



By the symmetry property of the R.C. tensor, B^^^B^^. It follows 
from (11-41) that (-B) yj 3s=() y Sj8> so that, by (11-421), 

ZVfc-V* 

or UV =UV. This is satisfied if U and V are both symmetrical or both anti- 
symmetrical. We shall assume for the present that both are symmetrical; 
the antisymmetrical case is treated in 11-5. 

There is a simple generalisation of the identity (5-41), which extends it to 
any two four-valued quantities 0, #, namely 

<75 

S 



t It will be seen that the suffixes are so arranged that the B.C. tensor is represented in a 
CD frame ( 10*4). This has a slight advantage in certain later developments. 



188 Physical Applications [11-4 

This is proved in the same way as (5-41 ),f and holds for the six pentads given 
by M = 0, 1, 2, 3, 4, 5. Let tf ay ==0 aXy + Xa y . (11-441) 

Then (11-43) becomes 

{S a (J^ a ) a j8 (^ta)yS ~~ (^ie)a]8 (^leJyS) ^08 = ^' ( * 1*442) 

Multiplying initially by F ay and using (11-422), we obtain 

2<7JW-i6i8 = 0- (11-451) 

Also, multiplying initially by V y (J^ v ) a , we obtain 

SA/ur^O 0^")- (11-452) 

This last reduction depends on the rule(^? ft> ,) a (JS? /[1(r ) a jg = (E va ) p when a^ v.The 
term for which a = v reduces to V y ( 1)^ (E^)^ U^ , which cancels the last 
term V y (iE^)^ (i) y U^ , owing to the symmetrical property of U and V. 
We now express the results (11-451), (11-452) in terms of the Einstein 
tensor O^ v and invariant 0. Since ^ MV = 8^, 

? 4-7? 1 

p3 T -"^4^4 > / 1 i 4 i \ 

j- (11-4O1J 

We use the above definition of G^ v to define new components (? 56 and G^QQ. 
Then (11-451) becomes r 



n 1- 

V A A 

or, by the antisymmetrical properties oFB llVa9 

- G/iii + 5 5/i5,* + ^Ofio^t = ^leie (1 1-463) 

Summing (11-463) for /*,= 1, 2, 3, 4, we obtain 

0-055-0 w = 4fl 1616 . (11-464) 

But by (11-462) 



Hence, writing 6060 = JS 0505 = A (11-466) 

we obtain from (11-464) and (11-465) 

*i6i6 = !(#-2A). (11-47) 

Thus (11-451) and (11-452) become 



Whence, by (11-461), 

^v-^v(G f -2A) (11-48) 



by the usual relativity formula for the energy tensor T^ v . J 

t The reader who wishes to check the calculation may find it helpful to refer to a later 
result, (11-54), (11-551). 

J Mathematical Theory of Relativity, equation (54*71). The constant A is at our disposal, 
and its value must be fixed by a convention (see 13*1). Here the value of A is chosen 
so that the macroscopic energy tensor may have a simple connection with the energy, etc. 
defined in wave mechanics; it will, in fact, appear as we proceed, that the mechanical 
identifications adopted in current wave mechanics presuppose that energy, etc. are 



11-4] The Riemann-Christoffel Tensor 189 

If the axes 5 and are treated as invariant, (11-48) is a tensor equation, 
and therefore holds for any orientation of the axes in four dimensions. 

We have proved (11-48) for a factorisable R.C. matrix, but owing to the 
linearity of the equations the proof can be immediately extended to non- 
factorisable matrices, which must be expressed as the sum of a number of 
products UV in (11-421). As already stated, it is postulated that U and V 
are symmetrical matrices; when they are antisymmetrical some additional 
terms appear which will be investigated in 11-5. 

By (11-48) the energy tensor is composed of two parts, which we shall 
distinguish as a kinematical part (T^) k and an electrical part (T^ v ) e , namely, 



Considering the kinematical part, the component T^ v determines the rota- 
tion of the comparison fluid in the /x5 plane linked to a rotation of the position 
vector of the object system in the v5 plane. These are the rotations which 
correspond to translations in space-time (6-2). Accordingly: 

The kinematical energy tensor consists of those components of the whole 
Riemann-Christoffel tensor which specify linkages of translations in four 
dimensions. 

It is this part of the tensor which is used in dissecting a relative motion of 
translation into separate absolute motions of the object system and 
comparison fluid. 

We now see why this part of the R.C. tensor is sufficient for macroscopic 
mechanics. The " continuous matter" treated in macroscopic relativity 
theory is supposed to be such that.its kinematics can be described by macro- 
scopically continuous displacements in four dimensions. The theory makes 
110 provision for microscopic vorticity. Atomic physics, on the other hand, 
is intimately concerned with vorticity or angular momentum on a micro- 
scopic scale. Relativity theory can take account of microscopic motions 
(e.g. heat energy of a gas), but only as signless quantities mean-square 
values. If the atoms were spinning preponderantly in one direction, there 
would be no indication of this in the motions as averaged for macroscopic 
purposes. 

Thus the mechanics of the atom cannot be treated on the basis of the 
energy tensor T^ v alone. The complete R.C. tensor B fAva9 including com- 
ponents with suffixes and 5, plays the part of a general mechanical tensor 
applicable to atomic as well as macroscopic problems. The macroscopic 
energy tensor forms part of it, B^ v6 . In atomic problems it is generally 



measured from the zero defined by the values of A given here and in 11*5. In cosmical 
problems the value is fixed by another independent convention which has secured general 
recognition; this cosmical value will not necessarily agree with the value here used, as we 
shall find in (11-592) and (11-593). 



190 Physical Applications [11-4 

more convenient to handle the complete tensor in its wave tensor form, 
i.e. as the R.C. matrix (B)^^. 

It has always been difficult to see the physical significance of the process 
of contraction used in obtaining the energy tensor from the B.C. tensor. 
We have no physical insight into what we are doing when we add together 
^1212 + ^1313 + ^1414 * f rm ~~ ^11 We now see that from the physical point 
of view the energy tensor T n is not specially related to this sum, but is 
another component #1515 of the same tensor. The summation is part (but 
only part) of a process of calculating T u when other components are known 
depending on an identity satisfied by the components. But equally jB 1212 could 
be calculated from the same identity if the other components were known. 

In the generalised field theoryf I have derived the fundamental tensor 
*Bp Va from the theory of relation structure. Since only a portion of the 
tensor (viz. the part preserved in the contracted tensor *O^ V ) was actually 
used in specifying the electromagnetic-gravitational field, we seemed to 
have " dragged up from below a certain amount of apparently useless 
lumber" (loc. cit. 99). It was natural to hope that the unused part might 
ultimately be needed in the representation of microscopic structure; 
though at the time 110 progress could be made in this direction. We now see 
that this hope is fulfilled, and the full tensor *-B /4W e is utilised in microscopic 
theory. (*/^ lV(7 differs from B^ va in providing for a macroscopic electro- 
magnetic field, which would be an unnecessary complication at this stage 
of our investigation.) There is accordingly thorough continuity between 
the present theory and the field theory developed in my earlier book. 

Field theories which are based on "five-dimensional relativity", i.e. 
theories which introduce a curvature tensor in five dimensions, have not 
so simple a connection with the present theory. B^ va has the double 
aspect of a partition tensor and a curvature tensor; as a partition tensor 
it includes components corresponding to two additional suffixes 5, 0; but 
the identification with a curvature tensor applies only to the suffixes 
1, 2, 3, 4, and in the present theory we do not extend this identification. 
For such an extension it would be necessary to introduce a fictitious 
extension of the distribution over a fifth or sixth dimension, so as to 
provide corresponding Christoffel brackets and components of curvature. 
Even if this were formally possible, it would be a retrograde development, 
undoing the advance made in 6-3, 6-4. Whether the number of suffixes 
is four or six, B^ va is a function of four coordinates only. It is to be 
remembered that wave mechanics is a statistical theory. If we introduce 
fictitious dimensions, the generalised formulae will determine steady states 
in five or six dimensions ; but these are not the states with which we are 
concerned in physics. 

t Mathematical Theory of Relativity, Chapter vn, Pt. II. 



11-4] The Riemann-Christoffel Tensor 191 

As an elementary example, let the object system be a particle with an 
exact momentum. Let us choose the time direction so that it is at rest. Its 
energy tensor then reduces to a single component T^p^ where /> is the 
proper density of its probability distribution. By (11-231) a time displace- 
ment 80 46 of the object particle corresponds to a time displacement 80 45 ' of 
the comparison fluid, given by 



S0 4 5. (11-495) 

For any other direction of displacement SO' = 0. 

It may seem surprising that in this case the displacement of the particle 
in spatial directions causes no recoil of the comparison fluid. But it must be 
remembered that the conditions are highly idealised. By taking the momen- 
tum vector to be exact the position of the particle becomes entirely unobaerv- 
able. Therefore the problem of analysing an observed spatial displacement 
into absolute displacements of the object particle and comparison fluid does 
not arise. To represent observed position in space we must introduce a wave 
packet; the momentum then ceases to be wholly in one direction, and we 
cannot choose the time direction so that T^ v reduces to a single component. 
There will accordingly be components T n , etc., specifying the way in which 
the observed spatial displacement is to be partitioned. 

Returning to a particle with an exact momentum vector, the particle is 
distributed with even probability over the wave front of its waves. Dis- 
placement in the wave front is merely a transfer of our attention from one 
point to another, which has no dynamical reaction on the comparison fluid. 
Displacement normal to the wave front (relative to the frame) is a physical 
change of the conditions, involving a linked displacement of the comparison 
fluid. We see from (11-495) that, when the object particle moves forward in 
time, the comparison fluid moves in the same direction. We have pointed 
out that the usual outlook, which has become incorporated in the equations 
of physics, requires that an object system and its physical reference objects 
should always move together in time that the reference system should be 
a simultaneous one.f We have therefore the condition 

.ffS0 45 ' = r80 45 , (11-496) 

where J280 45 ' is the displacement of the comparison fluid in linear measure. 
By (11-495) and (11-496) ^ y^ Er (u . 497) 

When we said above that a time displacement of the particle was a physical 
change of the conditions, we regarded the displacement as being applied to 

f This refers primarily to coordinate time, and so depends on the choice of coordinates. 
But, in the present simple example, if the two systems agree in coordinate time they will 
agree in proper time. 



192 Physical Applications [11-4 

the particle and not to anything else just as one regards space displace- 
ments. But it is a rooted habit of thought that a time displacement applied 
to one particle automatically applies to the rest of the universe so that 
time displacement also is a mere transfer of our attention from the present 
to a future moment. We here reach the same outlook in a more legitimate 
way by so choosing the linkage of displacement that the object particle 
automatically carries the comparison fluid with it in the time direction. 

As another elementary example, suppose that the object system is 
identical with the comparison fluid; so that the object is its own standard of 
reference. Since the object cannot change relatively to its simultaneous self, 
it will be described observationally as uniform and unchanging. But a 
uniform unchanging distribution of matter is an Einstein universe. Setting 
80 45 ' = 80 45 in (11-495) to express that the object is always compared with 
its simultaneous self, we have =1/4777-2 (11-498) 

which is a well-known formula for the density in an Einstein universe of 
radius r. 

1 1-5. The Dual Riemann-Christoffel Tensor. 

By 10-4 the R.C. matrix can be resolved in alternative ways. Let 

(*)^ = Sfi^(^ (11-51) 

or, in the notation of 10-4, 

^. (11-52) 



We call b^ va the dual R.C. tensor. 

An interesting case is when the dual B.C. tensor consists of a single 
component 6 1616 = 6. Then, by (10-494), 

(B) = bE l sF u =bX(C D ) 9 (11-531) 

the minus sign applying to time-like matrices. Hence, by (11-52), 



the minus sign applying when E^ is time-like. 

Our first impression is that (11-532) is the R.C. tensor for de Sitter space- 
time (11-233), whose quadratic curvature Jt~ 2 is positive for space dimen- 
sions and negative for time dimensions. But in our present coordinate 
system a: 4 is imaginary time; the ambiguity of sign in (11-532) is eliminated 
when we substitute real time rotations for imaginary time rotations; so that 
(11-532) represents a world completely isotropic in space and real time. 

De Sitter space-time is obtained by taking the dual R.C. tensor to consist 
of a single component 64545=6; so that 

(B) 6^45^45 - J6S ( CyU (1 1-533) 

the minus sign applying to a = 01, 02, 03, 04, 05, 16 by (10-495). The plus 
sign applies to all the components of the ordinary four-dimensional R.C. 



11-5] The jRiemann-Christojfel Tensor 193 

tensor; these accordingly agree with the values for a de Sitter space-time of 
radius given by J6 = -R~ 2 . 

By (11-421), (11-51) and (11-533) 



Thus U and V in this case consist of single components ^45^45, $45^45, 
(^45^45 = b). Since J^ 45 is an antisymmetrical matrix, U and F are anti- 
symmetrical. 

Before proceeding further with the investigation of de Sitter space-time, 
it is necessary to extend the theory given in 11-4 which treats only the 
symmetrical case. Let 



According to (11-43) Q OLY = # ya . Using the standard matrices (3-27) for 
the pentad E 0a , we find by direct calculation that |(? ay is the matrix 

p"* &sXi-<AiX2 ^2X4-^4X2 ^4X1- ^1X4 

* ^1X2-^2X1 ^3X2-^2X3 0iX3-0sXi 

^4X2-^2X4 ^2X3-^3X2 ^4X3-^3X4 



This is equivalent to 



where ?7 = -- Since 



the result is iQ ay =^ ay -H^45)cjf 7 cJ-(^45)a y - (11-551) 

If T is another antisymmetrical matrix, we obtain a corresponding R.C. 

matrix ( J B).^-C^^W. X ^+ K^A-^o)^^. (H'M2) 

Since ?7 and F are both antisymmetrical, (5) ay 0s ^ s antisymmetrical in a 
and y and in j8 and 8. Hence, by (11-51), & Mwr = unless jE7 Me and JSf w are both 
time-like. The dual R.C. tensor thus has at most 36 components. We have 

U ay F ay = S b uva (E^ (E va ) ay = S b^ spur (JE^ J w ) 

= 426^^ = 47, (11-553) 

where 7 = 6 2 323 + 6 3isi + ^1212 + ^0404 + ^osos + ^4545 (1 1-554) 

Also we have 

) e U l (^ 46 ) ay F ay = S b vuno spur ( J 45 ^ ) spur ( 



Hence, setting j8,8 = a,y in (11-552), 

(11-56) 



194 Physical Applications [11*6 

The term 6 4646 is distinguished by the fact that J0 46 is the product of the two 
time-like matrices in the pentad E 0a . This rule enables us to write down the 
corresponding term for other pentads. 

The left-hand side of (11-56) is the quantity which was treated in 11-4, 
yielding the equation (11-451). In the symmetrical case it vanishes identic- 
ally. The right-hand side of (11-56) therefore gives the additional terms 
which must be included in (11-451) when U and V are antisymmetrical. In 
place of (11-462), we have the following equations for the diagonal com- 

ponenteof^ -0 11+ 1616 + 1010 +2& MM = 5 1616 +r (11-57) 

~ #22 + ^2625 + -B2020 + 2&3131 - JS 1616 + 7 

- <?33 + fi 3536 + -B 3030 + 26 1212 = B uu + T 
+ 4049 + 26 0806 = B uu + Y 

+ 2&0404 - #1616 + 7 
-G w + B OS06 + 26 4546 = 5 16ie + 7 

Adding the first four and subtracting the last two, we have 



So that, setting 5 1616 + F = i (0-2A), (11-581) 

we obtain A = # 0505 + 26 0404 + 26 464B - Y. (11-582) 

The result differs from the symmetrical case in two ways: (1) the value of 
A is changed, and (2) the energy tensor G flv %g tJiV (G 2X) now has three 
constituents. If T^(T flv ) k + (T^ v ) e + (T^) a9 (11-583) 

the four diagonal elements of the new constituent are 

(^)a= ( - I/**) (&2323 , ftilll. &1212, &0505)- (H'584) 

The non-diagonal elements will contain the non-diagonal terms of b jtva . 

We return now to the dual R.C. tensor with a single component 
6 4545 = b = 4jR~ 2 , which gives de Sitter space-time. Then (11-584) vanishes, 
and the energy tensor consists of kinematical components #^5 and elec- 
trical components U^ovo as * n ^ e symmetrical case. Here we have B^^ = E~ 2 
and B^Q= - JR 2 , by (11-533); so that T^ v = 0, as it should be in a de Sitter 
"empty" world. By (11-582), A = 35~ 2 , which agrees with the usual result.f 

The "emptiness" of the de Sitter world is here represented as due to a 
cancelling of two energy tensors (Tp V ) k and (T^. This is because the sim- 
plest kind of space is an electrically saturated space. To obtain a metric 
agreeing with that of neutral space we have to cancel the electrical energy. 
It is therefore appropriate that the formulae should exhibit the zero energy 
as the result of cancellation. Another way of obtaining the de Sitter world is 
to take a dual R.C. tensor with two components 6 4546 = 6 0404 = 6. Since the 
suffixes and 5 are interchanged in the two components, the four-dimen- 

t Mathematical Theory of Relativity, equation (69-12). 



11-6] The Riemann-Christoffel Temor 195 

sional part of B /[4eva is unaltered; but the energies B^^ 9 B^^ are found to 
be zero. 

An Einstein world is obtained by taking a dual R.C. tensor with two 

components 64545 = &0505 = 2 ^" 2 - (11-591) 

We then find # 2323 , -B 3 m B m2> S i6i5> B *m> ^3535= B ~ 2 ' 

#0404* ^1618 = ~^~ 2 - 

The other components vanish. These agree with the values in a space-time 
spherical in three dimensions and cylindrical in the fourth, namely /? 2 323> 

^3131 ^1212 =^~ 2 J -Bl414 ^2424 > ^3434 0- ^ Or ^ e ener gy tenSOr W6 obtalll 

(remembering that there is a component (244)0= (V^^osos) 

87ry^ = (-l, -1, ~1, -3)B-a. (11-592) 

By (11-582), A = 0. This is a possible specification of an Einstein world, 
though it is not the one which has usually been given in general relativity 
theory. The energy tensor ordinarily adopted isf 

' = (0, 0, 0, - 2) JB-2 (11-593) 



where A' = R~ 2 . That is to say, ( 1 1-593) is obtained from ( 1 1-592) by changing 
the constant A (the cosmical constant) in the formula 



In other words we reckon energy and pressure from a different zero. This 
change of zero reckoning will be explained in 14-3. 

11-6. Metric and " a priori" Probability. 

Let fcr^Jo- -(fl^- ty.,0), (H-611) 

87r(^ v ) c = A^ v , (11-612) 

so that ^ = (^)o-(^)c- (11-613) 

We shall suppose tentatively that (T^ and (T^ are the absolute energy 
tensors of the object system and comparison fluid referred to the frame. J 
The ordinary relative energy tensor T^ v of the object system is thus the 
difference of the two absolute tensors. 

In macroscopic relativity theory g^ v is defined by the condition that 
SpvdXpdXy ( =efo 2 ) is a quantity measurable in a definite way by scales and 
clocks. This definition is clearly unsuitable for microscopic theory. We must 
adopt a definition of g^ v in terms of more primitive conceptions, and show 
later, when the theory is sufficiently developed to treat the extension of 
macroscopic aggregations (scales and clocks), that the macroscopic definition 
is equivalent. We therefore regard (11-612) as the definition ofg^. 



t For real time the last component becomes 87rT 44 '= +2 R~* 9 giving p 

{ The absolute energy tensor referred to the frame (self -energy tensor) is defined in 10*5. 



196 Physical Applications [11-6 

Apart from a numerical scale factor A/STT, the metrical tensor g^ v is simply 
the energy tensor of the adopted comparison fluid. 

We have seen that the geometrical frame of coordinates must be filled 
with a comparison fluid in order to form an ideally observable background 
of reference for the object systems that we study. The comparison fluid 
consists of a probability distribution of particles which might be specified 
in detail; but macroscopically it is sufficiently described by specifying its 
energy tensor (relative to the frame) as a function of the coordinates. This 
energy tensor constitutes a metrical tensor for the frame and is more 
familiar to us under that name. Having thus defined g^ v9 we can transform 
the coordinates locally to rectangular coordinates. Then (11-612) becomes, 
on raising a suffix, (r/) c = (A/87r) S/. (11-614) 

That is to say, the comparison fluid turns out to be uniform and isotropic at 
every point so far as the gross characteristics described by the mixed 
energy tensor are concerned. We do not choose an isotropic comparison fluid . 
Whatever comparison fluid is chosen turns out to be isotropic, because, 
being the comparison fluid, it is the standard of isotropy . 

In elementary problems we consider only one or two particles. It is not 
intended that there shall be no other particles in the universe. It is implicitly 
assumed that there is some innocuous way of distributing the other particles 
so that the results of our investigation will not be entirely invalidated by 
their presence. Formally what is known as "the problem of two particles" 
should be described as the problem of N particles, of which two are studied 
in detail whilst the remaining N 2 are supposed to have some standard 
average distribution which remains undisturbed by the behaviour of the 
two particles. We shall call the N 2 particles unspecified particles. A par- 
ticle is specified by giving it a wave function describing a particular pro- 
bability distribution of position or momentum; so long as its probability 
distribution is the standard average distribution there is no need to mention 
it in the problem. 

It would be redundant to surround the object system with a double 
environment of (a) unspecified particles waiting to be introduced into the 
analysis as the treatment grows more comprehensive, and (6) the particles 
constituting the comparison fluid. We therefore adopt a comparison fluid 
consisting of all the unspecified particles in the universe. Let the number 
be N. It is convenient also to define a partial comparison fluid having 1/^th 
of the density of the actual comparison fluid, and therefore corresponding to 
the standard probability distribution of one unspecified particle . The absolute 
energy tensor of a partial comparison fluid is, by (11-612), 

(^-(A/SidOfiW- (11-615) 

Whilst it is permissible to speak of the energy tensor T^ v belonging to a 



11-6] The Riemann-Christoffel Tensor 197 

particular object particle, we must not use the formulae equating it to 
curvature of space-time, unless the object particle is the sole occupant of 
the region considered. Since the probability distributions of a number of 
object particles may overlap, we must write (11-611) more generally as 

87^(2^)0= -(G^-^,0). (n-621) 

Here S extends over all the particles in the universe treated in turn as 
object particles, although in practice the main contribution comes from a 
few particles specified as being present in the region we are considering. 
(The combined density in any region of all the unspecified particles is 
probably less than 10~ 28 gm. per cu. cm.) 

We have therefore for the whole energy tensor in any region 

2 I ^=S(r^-(r M>p ) e L{(2'^-(2' M ,y, (11-622) 

there being N terms in the summation. Thus each object particle has the 
relative energy tensor (T ^. (T ^. (11 . 623) 

When an object particle is unspecified, it has the standard probability 
distribution, and its absolute energy tensor (IJ^Jo is equal to (T^ . Thus 
its relative energy tensor vanishes. It follows that for a specified object 
particle: 

The relative (observable) energy tensor is measured from the standard 
probability distribution as zero, and not from a hypothetical state of noii- 
existence. The conception of creating a particle does not enter into our 
theory. Energy is furnished to a particle by specifying it, not by creating it. 

The result (11-623) is the justification of the tentative procedure which 
we have been following in this section. It is necessary in order to conform 
to the general outlook of wave mechanics, which allows us to discuss a 
system of a few particles without explicit recognition of the vast number of 
other particles in the universe. If T^ v were merely a tensor involved in the 
inner mechanics of the system, it would not matter much from what zero 
it was reckoned. But in unified theory T^ is to be identified with the 
relativity tensor (-l/87r){G [ flJ/ -^ lf (G-2A)}; and for this it is essential 
that all the particles in the region, specified or not, shall be included. If in 
the formulation of the " problem of two particles" it is intended that there 
shall be no unspecified particles overlapping the system, this should be 
stated explicitly; the appropriate terms must then be introduced into the 
equation to represent the energy employed in creating a vacant space 
among the JV - 2 unspecified particles; in fact the unspecified particles must 
be specified as absent from the region. Since this is not the procedure 
followed, we must adopt the alternative given by (1 1-623), namely we must 
choose the zero of energy in such a way that the unspecified particles make 
no contribution to 2^'BO that we may ignore them without harm. 



198 Physical Applications [11-6 

Our results up to this point are: 

(a) The adopted comparison fluid consists of the unspecified particles of 
the universe. 

(6) The absolute energy tensor of the comparison fluid is the A-term 
(generally known as the cosmical term) in the relative energy tensor. 

(c) The absolute energy tensor of the comparison fluid is the metrical 
tensor of the frame. 

If a different comparison fluid were used it would be necessary to 
"specify", either individually or macroscopically, the particles .composing 
it. The macroscopic reference objects used in particular observations can 
be regarded as a specified comparison fluid. But it is simpler to treat these 
as additional object systems; so that the same comparison fluid (a) is used 
as intermediary in all problems, and the observed relations between object 
systems are resolved into relations (unobserved, but nevertheless ideally 
observable) between each object system and a common comparison fluid. 

The probability distribution which applies in the absence of special 
information is commonly called the a priori probability distribution. The 
unspecified particles are those as to which we have no special information. 
The probability distribution of an unspecified particle, which we have 
identified with a partial comparison fluid, is therefore the a priori pro- 
bability distribution. The identification 

A priori probability distribution = partial comparison fluid 
helps to connect our theory with current terminology; but it must be under- 
stood that we reject emphatically the conception of a priori probability. 
The distribution is uniform and isotropic not for a priori reasons, but for 
the reason already given, namely that it determines the tensor g^ v . It is, 
I think, generally admitted that the need for a conception of a priori 
probability is a logical weakness in statistical mechanics as ordinarily 
developed. We need not discuss here attempts to defend it on a metaphysical 
basis, and to deduce from a ' ' Principle of Indifference ' ' that it has a uniform 
and isotropic distribution. Probably few theorists would accept such views 
today. But it is not generally realised that relativity theory has rendered 
the conception of a priori probability entirely unnecessary ,f 

The so-called a priori probability distribution used in statistical mechanics 
is essentially an unobservable; for as soon as we have any observational 
information, the a priori probability ceases to apply, and a modified (actual) 
probability is substituted. A priori probability is to be treated therefore as 
other unobservables (introduced for mathematical convenience) are treated 
in relativity theory, e.g. frames of space and time. The a priori probability 
distribution is sometimes called a "basis of statistics"; that is to say, it is 
f New Pathways in Science, pp. 129 ff. 



11-6] The Riemann-Christoffel Tensor 199 

a standard of reference for statistical enumerations in the same way that a 
frame of space and time is a standard of reference for measurements of 
extension. Observable results are invariant for all transformations of these 
reference frames; they must equally be invariant for changes of the adopted 
a priori probability distribution. The common appellation is therefore a 
misnomer; the a priori probability distribution is not anything that is 
given a priori; it is an arbitrary comparison distribution which we can change 
as we please. 

The reason why it is not ordinarily recognised that observable phenomena 
are invariant for transformations of the a priori probability distribution, is 
that the transformation is not pursued to the end. If the change of reference 
distribution changes the equations of an electron and proton, it will change 
also the equations determining the behaviour of the measuring appliances 
composed of electrons and protons. When the transformation is applied 
both to the measuring appliance and the object measured, the result of the 
measurement is invariant. But if we apply the transformation only to the 
microscopic systems which are being measured and not to the macroscopic 
measuring apparatus, we fail to notice this invariance. That was the mistake 
originally made in interpreting the Michelson-Morley experiment; the 
transformation representing change of motion was at first applied only to 
the optical factors in the experiment, the mechanical or metrical factors 
being overlooked. 

This invariance is provided for in the usual way by a tensor relation 
between the quantities concerned in the microscopic and macroscopic parts 
of the theory. In the microscopic equations the change of a priori pro- 
bability is formulated as a change of the energy tensor (3^^) c of the com- 
parison fluid; in the macroscopic equations (which determine the behaviour 
of the measuring appliances) the same change is expressed as a change of 
the metrical tensor g^ v . 

The linkage of the changes of g^ v and (T^ c , so as to produce no observable 
result, is provided for by their description as tensors of the same kind. In 
(11-612) we have gone further, and identified them. The importance of this 
step is that if the frame and the comparison fluid are both described by the 
same tensor called in one case the metrical tensor and the other case the 
energy tensor it minimises the error of current quantum theory in con- 
fusing them. To give meaning to the ordinary equations of current quantum 
theory we have to suppose that a particular physical reference object is 
intended, and we have to discover what this reference object is. If, on being 
challenged, the quantum physicist were to reply that he intended the 
reference object to be an atom of carbon travelling at 150 km. per sec. in 
the z-direction, it would be preposterously misleading not to have stated 
this. If, however, he replies that he has not distinguished between the frame 



200 Physical Applications [11-6 

and the comparison fluid, so that when he said that the tensor g^ v of the 
frame was isotropic (i.e. that his coordinates were rectangular) he also meant 
that the comparison fluid was isotropic, his omission is a minor inadvertence. 
He might perhaps accuse us of stupidity if we did not supply the omission 
for ourselves. Having regard to this, our identification of g^ v with the energy 
tensor of the comparison fluid is an obvious rectification of an omission in 
current quantum theory. 

Locally any change of (2^ v ) c or g^ v is a tensor transformation; but over an 
extended region we have to distinguish between tensor transformations 
and absolute changes. From the second derivatives of the g^ we obtain 
certain characteristics of the comparison fluid, which we shall call hyper- 
characteristics, which must be specified definitely in any problem and must 
not be changed in any transformation. These hypercharacteristics corre- 
spond to irreducible gravitational fields, or to the distributions of matter 
causing such fields. Either an assurance that the field is zero, or a specifica- 
tion of the invariants of the field, is an essential datum of any problem con- 
cerning an object system which covers an extended region. The observable 
results are not invariant for transformations of the probability distribution 
of the unspecified particles which would create a change of irreducible 
gravitational field; and when, as usual, the comparison fluid is not mentioned 
explicitly as a datum of the problem, it is obviously implied that its cha- 
racteristics are not incompatible with the other data furnished, that is to 
say with the invariants of the gravitational field. 

Accordingly the statement that any distribution may be chosen as an 
a priori probability distribution requires qualification when an extended 
region is considered. It must be remembered that we are not defending 
a priori probability. In our theory the partial comparison fluid takes its 
place. Therefore we are not led into a discussion as to what kind of field 
should be assumed as probable a priori, if no observational information is 
furnished. If we are not told what the field is, we have no idea what will 
happen, or what is likely to happen and there is no more to be said. 

By ( 1 1-623) the unspecified particles of the universe make no contribution 
to Tp V , and can therefore be neglected in elementary problems at any rate 
so far as this part of the theory is concerned. But there are other ways in 
which the treatment of a system of a few particles as if they were the only 
particles in the universe is liable to be misleading. It is natural to discuss 
the effect of spatial displacement of a single particle, the other particles in 
the universe being unchanged. It is possible also to discuss a temporal 
displacement of a single particle, the other particles being unchanged. This 
conception is used frequently and legitimately in relativity theory; indeed 
it would be contrary to the fundamental principle of relativity theory to 
discriminate between space and time displacements. But in quantum 



11-7] The Riemann-Christoffel Tensor 201 

physics the notion of temporal displacement has been employed less care- 
fully. Usually when reference is made to a displacement (da?, dy, dz, dt) of 
a particle, it seems to be intended that (dx, dy, dz) applies to that particle, 
and dt applies to all the particles in the universe. Obviously, when this 
interpretation is intended, Lorentz transformations are not applicable to 
(dx, dy, dz, dt). Thus the common form of the "problem of one particle" 
combines spatial displacement of one particle with temporal displacement 
of N particles. The "one" particle is thus a hybrid whose time displace- 
ments are N times as effective as its space displacements in producing recoil 
of the comparison fluid; and the corresponding Riemann-Christoffel tensor 
has its time component 4545 magnified N times relatively to its space 
components B 1515 , etc. The corresponding radius of curvature is therefore 
R in the space directions, but R' = R/^/N in the time direction. Since N is 
about 10 79 , the geometrical picture has become too extravagant to be 
retained; but we express the same analytical relation by ascribing to the 
intrinsic vectors of the object particle the appropriate rotation 80', linked 
by .64545 to the rotation 8tf R of its position vector in space-time. This very 
rapid rotation of the intrinsic vectors (e.g. the stream vector) is interpreted 
as a periodic wave. The wave length should be 27TJR', which is of order 
1C" 12 cm., and is comparable with the wave length of the de Broglie waves 
of elementary particles. The precise calculation of the wave lengths for 
protons and electrons involves additional considerations which will be 
treated in due course. 

The waves attributed to an electron in current wave mechanics belong to 
it, not individually, but because it is one of N particles; and their wave 
length is dependent on the number N. Considered by itself, a particle has 
no other periodicity than that due to the connectivity of the space in which 
it is situated. The wave length of its "waves" is the circumference of space. 

11-7. Microscopic and Macroscopic Theory. 

In the most general formulation of the problems of physics the particles are 
divided into three groups: 

(a) One or more particles specified individually. 

(6) Collections of particles specified macroscopically. 

(c) The "rest of the universe " consisting of unspecified particles. 

In mathematical treatment the subject is divided into a number of idealised 
problems in which one of the groups may be absent. 

In general the particles of group (c) far outnumber all the others, and most 
of our formulae are developed on this understanding. There is, however, a 
class of problems, namely, cosmical problems, in which the whole universe 
is specified macroscopically, e.g. an Einstein universe; there are then no 



202 Physical Applications [11-7 

particles of group (c), and special precautions must be taken in applying the 
formulae (Chapter xiv). 

By (11-623) the energy tensor of the unspecified particles vanishes. But 
there is no such condition for the other components of their B.C. matrix; 
and the four-dimensional R.C. tensor of the unspecified particles is the 
foundation of the R.C. tensor of space-time. We shall call this the cosmical 
E.G. tensor. If it exists alone it gives a de Sitter space-time; for that is well 
known to be the only solution (without singularities) of the equation 
a tlv -$g fJLV (G-2X) = satisfied by space-time containing only unspecified 
particles. 

Notwithstanding the overwhelming preponderance of unspecified par- 
ticles, the effect of group (6) is often more important. This is because the 
effect is more localised. As part of their macroscopic specification, the 
position of these collections of particles is closely defined; and they con- 
tribute an R.C. tensor which locally far outweighs the cosmical R.C. tensor. 
Consequently the latter is commonly neglected in the local problems of 
general relativity theory, e.g. the theory of planetary motion. 

Quantum theory deals primarily with problems involving (a) and (c) only. 
The influence of macroscopic objects, if any, on these problems is attributed 
to gravitational or electromagnetic fields emanating from them. We may 
therefore call problems which do not involve group (6), directly or indirectly, 
field-free problems. In field-free conditions space-time is of the de Sitter type. 
The specified particles (a) are represented by wave-functions, and the un- 
specified particles (c) are represented by the cosmical R.C. tensor, or 
equivalently by the g^ v which correspond to it. 

It might be thought that the individually specified particles, like those 
of group (6), would contribute a-local R.C. tensor very much greater than 
the cosmical tensor. For example, the average density in the space occupied 
by an atom is very much greater than the density ( ~ 10~ 28 ) due to un- 
specified particles. But this is a confusion of ideas. We must make up our 
minds whether the R.C. matrix of a particle is to be absorbed into the R.C. 
tensor of space-time, or reserved for more detailed treatment by quantum 
theory. We cannot have it both ways. A particle specified individually is one 
of those reserved for detailed treatment. Its interaction with its surround- 
ings is expressed by wave functions. We must not at the same time insert 
a duplicate in the R.C. tensor, and so obtain another (cruder) representation 
of its interaction by the changes of g^ v which would result. 

We shall find later that the wave functions of a specified particle, employed 
in current theory, are defined in such a way that they represent an addition 
to, not a substitution for, the characteristics belonging to it as an unspecified 
particle. In that sense the specified particle is represented in the R.C. tensor 
as well as in the wave functions. But that merely secures that the cosmical 



11-7] The Eiemann-Christoffel Tensor 203 

tensor is not disturbed when the particle is specified. If it were not for this 
provision, the R.C. tensor of space-time would be reduced by about 1 part 
in 10 79 for every particle specified. As it is, we may regard the number of 
unspecified particles as a fixed constant N, giving a constant B.C. tensor 
however many particles may be specified individually. 

It is important to realise that the individual atoms and electrons, whose 
energy and momentum are given by their wave functions, do not disturb 
the curvature of the space-time in which their wave functions are represented . 
Reciprocally the macroscopic objects, whose energy and momentum are 
represented by components of curvature of space-time, have no wave 
functions.f For the purpose of investigating the connection of microscopic 
and macroscopic theories, we can represent macroscopic objects by com- 
plicated wave functions; but such wave functions occupy a space-time which 
has no more than the cosmical curvature. In 11-1 we referred to the error 
of confusing the frame with the physical comparison system, and remarked 
that the neglect of uncertainty of position and momentum of light com- 
parison objects and the neglect of curvature of space-time produced by 
heavy comparison objects were different forms of the same error. The error 
takes one form or the other according as the comparison objects are specified 
microscopically or macroscopically. 

Using the notation of 10-4, the R.C. matrix can be expressed in the 
various forms: 

(11-711) 
(H-712) 
^(E^. (11-713) 

Let us consider in particular the cosmical R.C. matrix which gives a metric 
of the de Sitter form. By (11-533) this isj 

(B^^^E^F^B^^C.D^^C^}. (11-721) 
This is the space tensor; the corresponding strain tensor is 

(ntraln = 4tf~ 2 Vl6=^ (H'722) 

as in (10-54). By 10-5 this represents an "uncertain" momentum in the 
CD-frame, or an "exact" momei^tum in the JEJjF-frame. 

Consider first the GjD-frame. By (11-712) the R.C. tensor B^ vo , which 
constitutes the complete energy tensor, has been constructed in the CD- 
frame; and it would therefore seem that the uncertain energy tensor is the 
true cosmical energy tensor. But in that case the position tensor is exact; 

t Or we may use wave functions which represent the kinetic energy and pressure of the 
particles of a gas, but not their rest energy; in that case the curvature of the space-time in 
which these functions are represented corresponds to the rest energy, but not the kinetic 
energy. 

% The suffixes k, e, s, t refer to kinematical, electrical, space-like and time-like matrices. 



204 Physical Applications [ ' 1 1 7 

that is to say the cosmical R.C. tensor is wholly concentrated at one point 
of space. This, of course, does not correspond to the actual conditions of de 
Sitter space-time. The failure is easily explained; we cannot do more than 
we have done with one double frame. The cosmical R.C. tensor is made up 
of contributions from some 10 79 unspecified particles, each of which has its 
own double frame. If we attribute to them uncertain momenta, they will 
have exact positions; but between them they will cover all space, giving 
a practically continuous R.C. tensor. 

Since (-B) Btraln is algebraic in the J^JP-frame, we can write, instead of 
(11-722), 

(*)strain=~^-M4^^ 

(11-73) 

introducing N double frames instead of one. If the frames commute (Case (b) 
in 2-7) they have no relation of orientation to one another. The method 
of imposing a uniform distribution of orientation belongs to the type of 
treatment of the many particle problem introduced in Chapter xvi. Here it 
is sufficient to recognise that the difficulty of avoiding concentration of the 
R.C. tensor at one point will disappear when we take account of its highly 
composite nature. 

But we have a great deal more to learn from the double frame, before 
proceeding to the many particle problem. We consider the simplest of all 
problems an elementary object particle in de Sitter space-time. The un- 
specified particles, which provide the de Sitter metric, form the comparison 
fluid for the particle. To bring the problem within range of treatment by 
double wave functions, we use a partial comparison fluid consisting of one 
unspecified particle. The object particle, being specified microscopically, 
does not contribute to the curvature of the space-time in which we represent 
its wave function. The cosmical R.C. tensor at any point corresponds to a 
uniform distribution of momentum vectors in all directions. We could 
suppose that this was a superposition of exact momentum vectors of a 
great number of unspecified particles; but it is evidently more convenient 
to take it to be the entirely uncertain momentum of one unspecified 
particle a partial comparison fluid. This will have definite position in the 
small region we are considering; but the other unspecified particles will 
provide similar R.C. tensors at other points. (We take advantage of our 
liberty to choose any convenient way of analysing the unspecified matter of 
the universe into "particles", i.e. wave systems.) The cosmical R.C. tensor 
at the point is the complete self-energy tensor of the partial comparison fluid. 
We have represented it in a CD-frame. By transforming to the J57JP-frame 
(so that the R.C. tensor is transformed into its dual 6 /ACva ) we change it 
into an exact energy tensor 6^45^45. We then readily recognise it as the 



11-8] The Riemann-Christoffel Tensor 205 

outer square of the stream vector of a neutral particle which was identified 
with the self energy tensor in 10-5. The unspecified particle is neutral, 
for otherwise it would not give the R.C. tensor characteristic of neutral 
space-time; also when the "entire uncertainty" applies to all components 
of a complete stream vector the sign of the charge necessarily shares in the 
uncertainty. 

Turning to the elementary object particle, since we are considering only 
a point, or small region, we must for consistency suppose it to have exact 
position and uncertain momentum. This condition is not easy to formulate 
since we do not want the charge and spin to be uncertain. It is simplest to 
start with an elementary particle and its partial comparison fluid defined as 
having exact momentum in the UjF-frame. We can afterwards transpose 
the results to the CD-frame. The product of the stream vectors of the object 
particle and of the partial comparison fluid will give the mutual energy 
tensor. This when transposed to the 6 y D-frame will give the R.C. matrix of 
the object particle. The portion B iav ^ + B llXM will give the ordinary energy 
tensor of the object particle. This as we have seen is the observable relative 
energy. Thus the relative energy defined by T^ v is the same as the mutual 
energy defined in 1O5 as the product of the stream vectors; except that 
the latter is expressed as a complete tensor and exhibits the kinematical 
and electrical energies separately, whereas T^ v gives their sum. We can 
separate the part corresponding to the self energy of the comparison fluid, 
as in (11-613), the remainder being regarded as an energy of the object 
particle.t For consistency this should agree with the self energy of the 
object particle defined directly in (10-51). This condition is investigated 
in 12-6. 

1 1-8. Neutral Comparison Fluid. 

By working in the J5? JF-frame we are able to use particles with exact stream 
vectors. Consider an object particle with stream vector C7 = SJK /i % fi = ^x* 
and a comparison particle with stream vector F = SjK ft v ft = ^eo*. The com- 
bined system is described by double wave vectors Y, X, where 

T = ^*, X = xo>*. (11-81) 

The microscopic treatment will depend on the use of these wave functions ; 
but the connection with macroscopic theory is given by the identification 

t Since the object particle and comparison fluid move in opposite directions relative to 
the frame in our coordinates (space and imaginary time), the self energy of the partial com- 
parison fluid should be taken to be minus the square of the stream vector in order that it 
may be comparable with a mutual energy. An unspecified particle treated as object particle 
would produce a recoil of a similar unspecified particle treated as its partial comparison fluid. 
This explains why the total energy in (11-613) is the difference, not the sum, of the energies 
of the object particle and comparison fluid. 



206 Physical Applications [ 1 1- 8 

(comparing (11-711) and (11-713)) 



where 6 /iv(r is the dual R.C. tensor of the object particle with the other 
particle as comparison fluid, together with the condition that the dual 
cosmical R.C. tensor b jMtr f which determines the metric is given by 

V V " = *fitwr'- (11-822) 

We have been mainly using space tensors, because we wished to connect our 
results with the space tensors of general relativity theory; but for the 
special purposes of wave mechanics strain vectors are more useful. We shall 
therefore take $, x> #> <*> * be covariant wave vectors forming a strain 
tensor 6 jltW . 

In this simple formulation we have taken the comparison particle (or 
partial comparison fluid) to be an elementary charged particle. Our standard 
equations, however, postulate a neutral comparison fluid. We wish now to 
direct special attention to the step by which we pass from a charged particle 
to a neutral particle as comparison fluid. The relation between the stream 
strain vectors F, V$ of an elementary and a neutral particle is 

TJ = qsF (11-83) 

This follows at once from (6-64), the strain vector of the neutral particle 
(equally likely to be in any one of the four states) being J (8 a + 8 b + S C + S d ). 
We can therefore pass from an electrically saturated to a neutral com- 
parison fluid by contracting the suffixes of ^ and w. The change is expressed 
by the transformation . , , , , m j.\ 

7 f OL < rBXv* lt *8'~*'iTOtYBXv***B (JLJi'o*) 

or, in terms of wave functions, 

Y^X^JY^Xrf. (11-85) 

In (11-85) we need no longer assume that Y and X are factorisable. A state 
of a double system is regarded as elementary if it is pure as regards factorisa- 
tion into Y and X, irrespective of whether there is a UV factorisation. The 
formula (11-85) is of great importance in Chapter xn. 

We have at last reached the solution of what might be called the initial 
problem of electron and proton theory. The microscopic complete relative 
(or mutual) energy strain tensor of an elementary charged particle together 
with a neutral comparison fluid has the form Y a 0X y . As the importance of 
this may not be obvious, we make the following comments: 

(1) The elementary charged particle is usually described by simple wave 
functions ^, #, and the energy and momentum are contained in a simple 
stream vector 0x*- By introducing an empirical mass constant (different 
for the proton and electron) these simple wave functions are made to serve 
the ordinary purposes of quantum theory and yield correct results. But if 



11-8] The Riemann-Christoffel Tensor 207 

we are not content to accept the mass as an empirical characteristic peculiar 
to wave mechanics, and wish to trace its connection with the macroscopic 
mass identified with a component of space-time curvature, we are met with 
the difficulty that all the measurable characteristics of a charged particle 
are contained in double wave functions of the particle and a physical com- 
parison system. The conditions of replacement of the double wave functions 
by single wave functions is the central problem in the unification of rela- 
tivity theory and current quantum theory. We take a large step towards 
the solution when we find that, adopting a neutral comparison fluid, the 
double wave functions are of the specialised kind Y a p, X y p, the second 
suffixes being "contractible". 

(2) Unlike $ and #, the wave functions T, X are directly connected with 
the fundamental tensors of relativity theory. Taking the energy tensors of 
a number of particles (such that the components ignored in macroscopic 
theory cancel out) we can calculate the curvature of space-time equivalent 
to them, and hence translate the energy, etc., into macroscopic units. 

(3) The replacement of the energy tensor TX* by an energy vector ^x* 
occurs by contraction of the suffixes j8, 8, which is the usual method in tensor 
calculus of reducing the rank of a tensor. 

(4) We have taken the comparison fluid to consist of one neutral particle, 
that being all that we can provide for with one double frame. It is therefore 
a partial comparison fluid according to our previous terminology. The 
passage to the actual comparison fluid consisting of N unspecified particles 
is, however, quite simple. The analysis in (11-73) represents the energy 
tensor as the sum of independent tensors in different frames; so that each 
object particle affects only its own partial comparison fluid. The average 
recoil of the total comparison fluid is therefore 1 /Nth of the amount of the 
recoil of the partial comparison fluid. The Riemann-Christoffel tensor is 
thereby lessened in the ratio 1/N. 

(5) The ideal conditions postulated in these equations correspond to 
de Sitter space-time containing no macroscopically specified matter. It 
should be remembered that de Sitter space-time is, according to ordinary 
standards, an expanding universe. It will be necessary later to treat the 
modifications involved in using a static frame of reference (Einstein space- 
time) but this involves introducing macroscopically specified matter, 

We shall now show that the contractible double wave vectors Y a 0, X y p 
can (for practical purposes) be replaced by simple wave vectors a , x y The 
observational significance of wave functions is that they determine expecta- 
tion values for the system of operators P which represent observable 
relations. Ordinarily for double wave vectors (appropriately normalised) 
the expectation value is X a gP ayt pT y 8; but in the present application the 



208 Physical Applications [11-8 

contraction of the second suffixes restricts P to the form P ay , and the 
expectation value is X P *F (11'86) 

Introduce eight simple wave vectors 0#>, rf (j8= 1, 2, 3, 4) defined by 

W-Ytf. x.W-X.0. 
Then (11-86) becomes 

X. W ^y* y n) + X^ ( ll ' 87 > 

This is the expectation value of P with respect to an impure simple wave 
tensor 22) 33 4). ( 1 1-88) 



By using symbolic factors of T the expectation value may be written 
FPTasin (3-35). 

Thus, for the purpose of calculating expectation values, the pure (con- 
tractible) double tensor *FX* can be replaced by the impure simple tensor T. 
The reduction of double wave tensors to simple wave tensors is therefore 
legitimate, but involves a re-analysis into pure elementary states. 

We have considered the reduction of an isolated (discrete) wave tensor. 
There still remains the question of replacing the phase space of the double 
wave tensor by that of a simple wave tensor. We shall see in the 'next 
chapter that this transformation of the phase space has important con- 
sequences. 

11-9. Retrospect. 

In this chapter a gulf between macroscopic relativity theory arid micro- 
scopic quantum theory has been bridged. We have found how to construct 
a Riemann-Christoffel matrix, (a) out of the tensors employed in general 
relativity theory, and (b) out of the wave functions employed in quantum 
theory. The mechanical quantities, energy, momentum, etc. referred to in 
the two theories are thus reduced to a common form. We can, for example, 
add the R.C. matrices corresponding to the wave functions of a large 
number of particles and derive the energy tensor which describes the 
aggregation macroscopically. 

Whereas the construction of the R.C. matrix out of macroscopic tensors 
is straightforward, the theory of its construction out of wave functions is 
perhaps unexpectedly intricate. It is natural to put the question, What 
is the R.C. matrix corresponding to a proton or electron with an exact 
momentum vector? The question is not so elementary as it seems, because 
the properties ascribed to the particle are really combined properties of 
the particle and a standard comparison fluid; and although the particle 
is elementary, the standard comparison fluid is not. If we consider 
instead an object particle referred to a single comparison particle, both 
having exact momentum vectors, the answer is simple. The R.C. matrix 



11-9] The Riemann-Christoffel Tensor 209 

is the outer product of their stream vectors. Already in Chapter x, the 
portion of this R.C. matrix which specifies linkage of translations, viz. the 
kinematical energy tensor, has been called the "mutual energy tensor" 
of the two particles ; but this was merely a development of the nomenclature 
of current quantum theory. The connection with energy as classically, 
i.e. macroscopically, defined appears for the first time in the present Chapter. 

The standard comparison fluid has an isotropy in space and time which 
can be defined by the condition that its energy tensor (referred to the 
frame) is invariant for rotations and Lorentz transformations, or equi- 
valently by the condition that its energy tensorf is to be identified with 
the metrical tensor g^. To pass from an elementary comparison particle 
to this standard comparison fluid, it is not sufficient to integrate over 
a symmetrical distribution of stream vectors of the comparison particle. 
The symmetry is disturbed by the recoil due to the motion of the object 
particle (specified by its exact stream vector). To neglect this recoil would 
be to neglect the very thing we are investigating, namely the mechanical 
specification of the object particle ; for in the idealised universe consisting 
of an object particle and comparison fluid, the only manifestation of the 
mass of the object particle is in its mechanical reaction on the com- 
parison fluid. 

In treating the construction of the R.C. matrix, and the connection of 
mass and momentum in quantum theory with the corresponding macro- 
scopic tensors, it is essential to bear in mind that displacement of the 
object particle disturbs the comparison fluid, the combination of action 
and reaction being expressed by double wave functions. But in applying 
this theory to the elementary wave functions of quantum theory, the 
complication arises that these are adapted to a different point of view. 
These simple wave functions are intended to describe self-contained 
systems superposed on an undisturbed environment. The conception of 
such detached systems is essentially non-relativistic, but it can be justified 
up to a point as a practical procedure. It is evidently necessary that the 
energy and momentum associated with the simple wave functions should 
include, not only that which properly belongs to the object particle, 
but also the recoil energy and momentum communicated to the comparison 
fluid. We have therefore to separate out from the whole R.C. matrix the 
constant portion which corresponds to the undisturbed comparison fluid 
(and provides a constant metric) ; the remainder is then regarded as the 
self energy of entities which have been added to the comparison fluid 
without disturbing it. The actual calculation of this self energy, for an 
elementary particle added to the standard comparison fluid, will be treated 
in Chapter xn. 

| More precisely, its kinematical self energy tensor. 

ETP 14 



210 Physical Applications [1 1-0 

A much easier problem is that of an object particle with an entirely 
uncertain momentum vector, whose recoil accordingly does not disturb 
the symmetry of the comparison fluid. Complete uncertainty involves 
uncertainty of charge and spin, so that the particle is necessarily neutral. 
To calculate the corresponding E.G. matrix we project the uncertain 
stream vectors of the object particle and comparison fluid from the 
EF frame to the CD frame. In the latter frame the stream vectors 
become exact; we therefore form their outer product, and project back 
into the EF frame. The resulting R.C. matrix is of the form (11-721), and 
gives a R.C. tensor which represents de Sitter space-time. 

We can regard the standard comparison fluid as composed of N neutral 
particles with uncertain momentum vectors, and with exact position 
vectors uniformly distributed in space and time.f Generally in treating 
a single object particle we use only one of these particles as comparison 
fluid a partial comparison fluid. In any problem relating to the actual 
universe the N particles of the comparison fluid must be present as object 
particles, although only a few of them are mentioned explicitly; the 
remainder are either unspecified or specified macroscopically. It is there- 
fore convenient to pair the particles, and treat each object particle as 
disturbing only its allotted comparison particle. The substitution of a 
partial for a total comparison fluid magnifies the recoil, and therefore the 
apparent mass, in the ratio N] this factor is absorbed into the ratio of 
the units used respectively in macroscopic theory and quantum theory, 
the elementary equations and definitions of the latter theory being, as it 
were, based on the assumption that the particle under consideration is 
the only particle in the universe. We shall find later that more precisely 
the factor is fJV; the modification is due to the exclusion effect of the 
particles of the comparison fluid on one another. 

When all the particles are unspecified we obtain the cosmical R.C. 
matrix. Extracting from it the macroscopic R.C. tensor, we find that it 
corresponds to de Sitter metric. The energy tensor T^ v vanishes; this 
is the relative tensor of the aggregation treated as object system referred 
to the same aggregation treated as comparison fluid. The relative energy 
tensor is closely related to the mutual energy tensor (which in this case 
becomes a self energy tensor) ; but in the mutual or self energy tensor 
the kinematical and electrical components are kept distinct, and the 
former constitute the energy components in the more limited sense. The 
kinematical self-energy B^ does not vanish, and, as already explained, 
it is identified with g^ v . 

To obtain any other than a de Sitter metric it is necessary to specify 

f This is for analytical convenience; the actual distribution in position is, of course, 
continuous. 



1 1-9] The Riemann-Christoffel Tensor 21 1 

some particles macroscopically. The only important alternative metric is 
that of an Einstein universe ; this will be treated in Chapter xiv. Irregular 
metrics are important in general relativity theory, but it seems un- 
profitable to combine them with microscopic problems, e.g. to investigate 
the behaviour of a hydrogen atom in an irreducible gravitational field. 
Practically the perturbation would be exceedingly minute; and if we 
retain such small quantities, it is presumably illegitimate to employ a 
macroscopic specification of the perturbing system. In any case the 
problem is concerned with a special kind of perturbation of the atom and 
it would be out of place to deal with it in general theory. For the 
purposes of quantum mechanics we are therefore limited to the two 
possible uniform metrics, viz. the de Sitter and Einstein universes. 

The prominence of the de Sitter metric in our investigations is due to 
the fact that, following Dirac, we have developed the theory from Lorentz- 
invariant equations. Lorentz transformations are not applicable to an 
Einstein universe. So long as we treat free particles, with exact momentum 
vectors and therefore distributed uniformly throughout the universe, 
a Lorentz-invariant frame of de Sitter type is required. When we come 
to less abstract problems, and treat steady states, represented by internal 
wave functions to which Lorentz transformations are inapplicable, an 
Einstein metric becomes admissible; and it is to be preferred because 
(unlike the de Sitter metric) it forms a static reference system. Current 
quantum theory in its more abstract formulae presupposes a de Sitter 
background, and in its more practical formulae presupposes an Einstein 
background. Perhaps the most difficult part of the present investigation 
has been the sorting of these two influences. 



CHAPTER XII 
THE MASS-RATIO OF THE PROTON AND ELECTRON 

12* 1 . Contraction of a Volume Element. 

This chapter deals with a point which arises when the double strain vector, 

<S-Z7F-* B ^ Xr 8 , (12-111) 

of an elementary particle, with another elementary particle as comparison 
fluid, is contracted to form the simple strain vector, 



which represents the same particle with a neutral particle as comparison 
fluid (H-84).t 

We may remind ourselves how this process arises. From an observational 
point of view, it is meaningless to describe a particle without some physical 
reference object (11-1); and the observable characteristics which we 
measure are properties of the combined system of the particle and reference 
object. The wave functions containing observable characteristics, such as 
the mass of the particle, must therefore be double wave functions Y, X, of 
the particle and an idealised standard reference object which we call the 
comparison fluid. But in current quantum theory the mass and other 
characteristics of the particle are assumed to be contained in simple wave 
functions 0, x- How comes it that double wave functions Y, X can be 
replaced in practice by simple wave functions 0, #, apparently without 
much harm? The first part of the answer was reached in 11-8, when we 
found that, by adopting the probability distribution of a neutral particle as 
comparison fluid, the double wave vectors Y = a <0, X = Xy w S are con ~ 
tracted with respect to the suffixes j8 and 8 which refer to the comparison 
fluid; so that, for some purposes at least, we can ignore the second suffix 
and treat Y and X as simple wave vectors. In particular the double strain 
vector S is contracted to {/S}, and becomes the product (actual or symbolic) 
of two simple wave vectors ifi, #, together with a merely algebraic factor 
<f>pa>p. 

It is not our business to supply a complete defence of this substitution of 
simple wave vectors for double wave vectors. It is a substitution which 
current theory has inadvertently made; and we have to trace how its con- 
sequences appear in the formulae of current theory. For our own part we 
shall be continually going back to the double wave vectors to see what 
current theory has missed by this substitution. 

t The factorisation in (12-111) and (12-112) need only be symbolic. 



12-1] The Mass-ratio of the Proton and Electron 213 

The strain vector (12-111) specifies an elementary configuration of the 
combined system represented by a single point in its phase space. The 
actual state is specified by probability factors attached to a number of such 
configurations, or more generally by a continuous probability distribution 
over the phase space. In the latter case S is normalised so that Sda)jfl is 
(apart from a unitary factor containing the phases) the probability of a 
range of configurations dw ( 7-7). 

The question that we have to consider is, What happens to dw when we 
contract (12-111)? 

Our procedure is based on the law of multiplication of probabilities. The 
intention is that, leaving out the unitary phase factors, S=UV shall 
express the law that the probability of a configuration of the double system 
described by 8 is the product of the probabilities of the corresponding 
configurations of the simple systems described by C7 and V. But SUV 
does not express this unless volume elements are inserted on both sides. 
The formulae as they stand postulate that the different configurations 
are represented by discrete wave functions; they can only be extended to 
continuous wave functions if it is assumed (or arranged) that the volume 
elements take care of themselves. 

It will now be convenient to change the notation. Let /S, $', 8 Q be the 
strain vectors of the object system, comparison fluid and combined systemjf 
and let dw, da>', da> Q be volume elements in their respective phase spaces. 
To neutralise the comparison fluid we add together four elements da> Q which 
yield the required balance of charge and spin; and we equate the probability 
contained in these four elements to the product of the probabilities in the 
corresponding elements of the phase spaces of 8 and S'. Denoting the 
summation of four elements by { }, this gives (disregarding unitary factors) 

{S }dW"o= (Sda>l&) (S'dw'IV). (12-12) 

In order that this may reduce to {$ } = 88' as currently assumed, we must 
have dw Q =kda>dw' 9 (12-13) 

where k is the constant i2 /i22'. 

The point at which the assumption {$ } = 88' enters fundamentally into 
current theory is in the dynamical equations (8-4). As explained in 9-1, 
the significance of these is that they give "steady states" distinguished by 
the constancy of some observable characteristic. Primarily therefore the 
dynamical equations apply to the double space vectors T of a particle and 
comparison fluid, since the particle alone has no observable characteristics. 
But in practice we substitute the simple space vectors T, T' which express 
(unobservable) relations of the two systems to a geometrical frame. Whether 
these will satisfy the equations, i.e. whether T and T' will be parallelly 

f For brevity, the system whose strain vector is 8 will be called "the system S". 



214 Physical Applications 

displaced when the double vector T containing the recognisable character- 
istic is parallelly displaced, depends on their relation to T Q . If T = TT' 9 
parallel displacement of T and T r involves parallel displacement of T Q . 
But this is not in general true if T do> = TT'da>da>', since volume elements 
are not transformed by parallel displacement in the same way as space 
vectors. The dynamical equations currently adopted therefore impose an 
additional condition TT' = T , or in the present application TT' = {T }. The 
corresponding strain vectors therefore satisfy SS' = {S }. 

Equation (12- 13) results from a comparison of two conditions, 

{S Q }da> = kSdto . S'da>', {S Q } = SS'. (12-14) 

The first is imposed by the law of multiplication of probabilities, and the 
second by the dynamical equations. 

We can now define more narrowly the circumstances in which (12*13) has 
to be satisfied. Since it represents an assumption in the dynamical equations, 
it applies to displacement in the dynamical coordinates. Further, since the 
dynamical equations are differential equations of the second order (9-2), 
we have to retain squares of the coordinates in the investigation. Local 
orthogonal coordinates in phase space are insufficient; and it will be neces- 
sary to use a more extended system such as stereographic coordinates. 

Before applying (12-13) we must set forth certain general considerations 
which arise in combining two systems. For purposes of exposition it is more 
convenient to consider two elementary particles S 9 S' ; but S' will ultimately 
be replaced by a neutral particle. 

12-2. Combination of Two Systems. 

As an approach to the more general theory, let us first consider the com- 
bination of two particles S, S' into a single system S according to the 
elementary wave mechanics of de Broglie and Schrodinger, in which each 
particle has only one phase variable. Let 0, 0' be the phase variables, so that 
the wave functions of S and S' are 

<A = e**V , 0' = e*^ '. (12-211) 

These are combined by multiplication, so that, setting *F = ^r ^r ', the wave 
function of the double system is 

T = e**< '+*'>%. (12-212) 
As in 7-4, we introduce linear variables by setting 

ds = Rde, d8 f = R'd8'. (12-213) 

Then, writing m 1/21?, m' = 1/25', (12-214) 

we have T = e******^ . (12-215) 

In this form the phase angle ms + m's' can be interpreted as action, the 
action of a system being Jracfo. 



12-2] The Mass-ratio of the Proton and Electron 215 

We have described Y as the wave function of a double system. That is not 
quite the same thing as the wave function of a combined system. To obtain 
the wave function of the combined system, the time coordinates t, t' of the 
two particles must be equated. Consider, for example, the sun-earth system. 
That does not comprise a combination of the earth today with the sun a 
week ago; no reference to the "orbit" of such a combination will be found in 
astronomical textbooks. The essence of the process of "combining" is the 
substitution of a single time coordinate for the whole system instead of 
independent time coordinates for its separate parts ( 7-8). 

We are not arguing that there would be anything illegitimate in the con- 
ception of 8 and 8' as a double system, i.e. with two independent time 
variables. Our point is that there are two possible conceptions whicli must 
be carefully distinguished, and that the usual description of a composite 
system refers to the conception of it as a combined system, i.e. with a single 
time variable. When we speak of the orbit of a planet or the quantum state 
of an atom we are referring to the combined system, not the double system. 

When s and s' are expressed as functions of the coordinates, the wave 
function T of the double system is a function of eight coordinates x, y, z 9 1, 
a?', y', z', t'. The wave function of the combined system S Q is (*F) r _,, a function 
of seven variables. The mass or energy of S is the value of the operator 
-id fit, which by (12-215) is 

mdsldt + m'(d8'ldt') t , H . 

This is the sum of the masses of S and S' at a simultaneous instant, the 
factors ds/dt, ds'/dt' being the FitzGerald factors representing change of mass 
with velocity. 

In the general theory the simple algebraic phases 0, 0' are replaced by 
space-like matrices with ten components (7-3) which, for a range of con- 
figurations small enough to be referred to local orthogonal coordinates, may 

be denoted by = SJEf0=SJ5?a;/-R, 1 

s n n^ t p p/ I (12-221) 

and the wave functions (within the above small range) are 

0=e**0 , 0' = e**'0 '. (12-222) 

For the wave function representing a combined system S Q we have similarly 

T=c**T , (12-231) 

where t^^^E^O^^^E^x^B^. (12-232) 

In forming a combined system out of S and S' we equate their times t and 
t', and this common time is also the time t of the combined system S . 
Since the time in phase space is represented by the algebraic coordinate, the 
identification t = t' = J becomes, in the above notation, 

(12-24) 



216 Physical Applications [12-2 

We notice that, whereas in Schrodinger's theory the algebraic phase 
variable represents s, in the general theory it represents t. The difference is 
not surprising, since in the former theory one variable has to do the work of 
ten. In the matrix theory the strains of the system corresponding to dis- 
placements along x, y, z are provided for more precisely by separate non- 
algebraic phase variables, and therefore no longer enter into the algebraic 
phase. A possible misunderstanding may be caused by the fact that in matrix 
theory just as in Schrodinger's theory the wave function for plane waves is 
i// = e (m8 *// Q ; but here ims is the eigenvalue of a matrix expression representing 
the phases (i/r having been specially chosen as an eigensymbol), and is not 
to be identified with the algebraic phase Ji0 16 . 

It may be well to state again the reason why the time is represented by 
the algebraic coordinate in phase space. Progress in time corresponds to 
rotation (about the centre of curvature of space-time) in the plane associated 
with jB 45 . Hence energy conjugate to the time is the 45 component of the 
complete space vector which comprises the mechanical properties of the 
particle. The space vector is multiplied by iE^ to form the associated strain 
vector; energy is therefore the E IB component of the strain vector. The trans- 
formation g = ei fcl iA applied to the strain vectors in phase space, which 
causes displacement in the algebraic coordinate, is therefore a displacement 
conjugate to the energy, i.e. displacement in time. 

The phase space of $ contains 136 coordinates v . It may be suggested 
that since the configurations of S and S' are each completely specified by 
ten coordinates, one of which they have in common, we shall require a phase 
space of 19 dimensions at most to specify the combined system $ . But that 
is to begin at the wrong end of the problem. Our observational data relate 
to complex systems. We do not construct S Q to represent data which have 
been ascertained about S and S'', we construct 8 and S' to represent data 
which have been ascertained about $ . We have to take a system repre- 
sented by a double wave vector in any combination of its 136 phases, all 
combinations being relativistically equivalent, and determine the conditions 
under which it can be dissected into two systems each with ten phases, 
or into a ten-phase system and a neutral comparison fluid with a single 
phase. 

Initially the probability of $ is distributed uniformly over its 136- 
dimensional phase space. We replace it by two initial probability distribu- 
tions uniform over the ten-dimensional phase spaces of S and S'. A great 
variety of configurations regarded as potentially distinguishable in the first 
representation must be classed as indistinguishable in the second representa- 
tion. That is to say, there is an alteration of the basis of statistics. It will be 
found that this alteration has important consequences. 



12-3] The Mass-ratio of the Proton and Electron 217 

12*3. The Augmented Phase Space. 

In combining 8 and S' we drop a time coordinate. Accordingly before 
dissecting $ we must insert an additional time coordinate and treat S as 
having an infinitesimal uniform extension in this extra time. Let 



so that dt Q dt f =dtdt'. (12-32) 

Reducing to angular measure with the respective scale constants of the 
three phase spaces, (12*32) becomes 

R Q *dO Q dO r =RR'dO lB dd lB '. (12-33) 

By this augmentation the phase space of S Q contains two algebraic 
phases ( = 16 16 ) and r . Owing to their commutative property we can 
treat the probability distribution in the algebraic phases independently of 
the distribution in the other phases (cf. 7-7). For example, in ten-dimen- 
sional phase space the volume element in local orthogonal coordinates 
daj = d0 1 d0 2 . . . contains an amount of probability do>/2, where Q is the whole 
volume of phase space which we have shown to be finite ( 7-4). Writing daj c 
and l c for the corresponding volumes without the algebraic dimension, we 

haVe dco = d0 16 do> c , Q = 27rii c , (12-34) 

and the probability dw/Q is the product of the independent probabilities 
d0 16 /27T and do> e /!i c that 16 is in the range d0 16 and that the other coordinates 
are in the range da) c . 

Hence the initial probability associated with the range d0 16 d0 16 ' is 
(d0 16 /27r) (dO lB '/2Tr) and the initial probability associated with the range 
d0 d0 r is (d0 /27r) (dO r /27r). These must be equal, since rf0 16 d0 16 ' and d0 d0 r 
represent the same range described in different ways. Hence by (12-33) 

R*=RR'. (12-35) 

The new variable r must, like the other phases, correspond to a circular 
(not hyperbolic) transformation, since it is essential that phase space shall 
be closed (7-3). We cannot begin the process of dissection of S unless this 
condition is satisfied. It is, however, desirable to examine an argument 
which seems to suggest that t r should be represented by a hyperbolic 
transformation. 

The definition r = r /JR refers to local orthogonal coordinates, and for 
extended coordinate systems the relation of t r and r becomes non-linear. 
Adopting stereographic coordinates and considering variations of t r only, 
we have, by (7-58), 



so that r = 2tan- 1 ( r /2JB ). Thus, as r increases, t r increases without limit, 



218 Physical Applications [12-3 

provided that O r is real. If, however, r is an imaginary quantity iu r + we 
replace t r by it r and obtain 



so that w r ==2tanh~ 1 (^/2JZ ). Then as w r ->oo, r ->2JR . Accordingly -' is 
not greater than 2R . It suggests itself that 2R should be identified with the 
distance between S and 8', and therefore with the light-time (the velocity of 
light being unity). The result t t' > 2R then means that two systems cannot 
be combined at instants such that one is in the absolute future of the other. 
The variable u r is found to be important in another problem which arises 
in the combination of two particles ( 15-5), but here it is irrelevant. The 
radius R of phase space is a scale constant applicable to all configurations 
represented in phase space. If we equate 2JR to the distance between the 
particles, the phase space can contain only those configurations for which the 
distance is a fixed constant. That is not the problem here treated. 

12-4. The Fundamental Quadratic Equation. 

Writing da) = V g.dr as in (7*422), the relation (12-13) becomes 

(12-41) 



The formula now applies to any system of coordinates. 

We adopt stereographic coordinates. It has been shown in 7-5 that when 
a transformation (not necessarily infinitesimal, but not involving anti- 
perpendicular components) is applied, each point of phase space in the 
infinitesimal neighbourhood of the origin receives the same increment of its 
stereographic coordinates. That is to say, when a volume element is dis- 
placed in phase space dr remains constant. Hence for such displacements 
in the three phase spaces we must have 

V^ = CV^.V^V, (12-42) 

where C is a constant. By (7-59) 



where r 2 is the square of the length of the displacement, and n is the number 
of dimensions of the phase space. Inserting this in (12*42), we have 

(1 + r 2 /4# 2 )-o = (i + r 2 /4# 2 )-* (1 + r' 2 ^' 2 )-"' (12-43) 
together with JB - W = CR" n R'~ n '. 

Consider a displacement in time, so that r, r', r are # 16 , # 16 ', # 16>16 , and 
are all equal by (12*24). Then, expanding (12*43) and equating coefficients 
of r 2 , we have n /Itf=n/IP+n'/Il'*. (12*44) 

By (12*35) and (12*214), 1Z 2 = BR', and R/R' = m'/m. Hence 

'*=0. (12-45) 



12-5] The Mass-ratio of the Proton and Electron 219 

For an elementary particle and a neutral comparison fluid, the dimensions 

of the phase spaces are 

'!. (12-46) 



(The strain vector of a neutral particle, being algebraic, has only one phase 
variable.) Hence we obtain the fundamental quadratic equation 

10w 2 -136ww' + m' 2 = 0. (12-47) 

Its two solutions give two possible masses m for an elementary charged 
particle in terms of the mass m' of a neutral particle. These accordingly will 
be the masses of the proton and electron. The ratio of the two roots is 1847-6. 
This is the ratio of the masses of the proton and electron, when mass or 
energy is defined as in quantum theory by the operator ( - iA/2?r) 3/3*, or 
equivalently by E = hv. We shall find in 15-9, that this deviates somewhat 
from the classical definition of mass and energy; and that, adopting the 
classical definition, the theoretical mass-ratio is 1834-1. Since all our 
formulae imply the quantum definition, we must adhere to the value 
1847-6 in the developments which follow. 

12*5. Notes on the Solution. 

One or two steps in the foregoing proof require fuller discussion. 

Should we take M O = 136 (original phase space) or 137 (augmented phase 
space)? Equation (12-42) was obtained by considering parallel displace- 
ment of a volume element dr conformably with the dynamical equations. 
The angular element dO r is not a range dynamically transferred from the 
origin by parallel displacement; it is a constant infinitesimal thickness 
assigned to phase space, and hence equal for all volume elements considered. 
The value w = 136 is therefore correct. 

Is it legitimate to satisfy (12-43) as far as r 2 , leaving a discrepancy in the 
higher powers of r? Equation (12-43) is not valid beyond r 2 ; its exact form 



1 ', (12-51) 

where J , t, t' are corresponding times in the three systems measured in 
stereographic coordinates. The times are equal in some system of reckoning, 
but there is no reason to suppose that that reckoning is stereographic. In 
fact for large values of t the stereographic reckoning becomes absurd. The 
substitution t t t' =r is only valid to the first order of small quantities, 
and therefore (12-43) is only valid to the second order. To the first order 
(corresponding to natural coordinates) the reckoning of t is fixed by the 
scale constant R, and there is no ambiguity as to what is meant by the 
three systems being simultaneous. In short, having found m/m' from the 
terms in t 2 , the higher powers merely tell us how we must define simultaneity 
of the three systems for large values of t. 



220 Physical Applications [12-5 

By obtaining a formula correct as far as r 2 , we obtain exact values of the 
second derivatives at the origin; so that we achieve our aim of providing for 
the second order differential equations which express the laws of physics. 
Results obtained in a special coordinate system, such as stereographic 
coordinates, are of no great interest in themselves. We use them to extract 
the invariants, in particular the curvature invariants which correspond to 
mass and density. For this purpose it is only necessary to expand as far as r 2 . 

We come to a more difficult point. The displacement which we have con- 
sidered is along the algebraic coordinate; but the algebraic coordinate is 
very much aloof from the others; and although we have used the same stereo- 
graphic projection for it as for the others, there is no obvious need to do 
so. It is not possible in an extended region to use natural measure for all 
coordinates; but it is possible to use natural measure for the algebraic co- 
ordinate and stereographic measure for the rest. Geometrically the distinc- 
tion is that the algebraic dimension has cylindrical curvature, whereas the 
others by their non-commutative relations determine a spherical curvature. 
Spherical curvature is an invariant; and on the assumption that E, R, R Q 
are radii of spherical curvature, the result (12-44) is independent of the 
special system of coordinates used in obtaining it. Unfortunately, in the 
equation as derived, they are radii of cylindrical curvature. To validate the 
proof we must show that there is some condition (not yet mentioned) which 
requires the algebraic coordinate to be represented uniformly with the 
others. 

The fact is that we have not explicitly introduced the condition that the 
particle whose mass we are seeking is a pure elementary particle, having 
therefore a singular strain vector. As shown in 7-9 such a particle will have, 
associated with it, a singular line in phase space given by d 8 = ( E l + J57 16 ) d<f>. 
This provides an absolute scale comparison of lengths in the A\ and the E 16 
directions just as in four-dimensional space-time a singular line (con- 
stituting a light track) provides an absolute scale comparison between 
intervals of space and time. 

To go back to the first principles of measurement magnitudes in different 
directions are compared on the basis that "equivalent" magnitudes (which 
can be transformed into one another by relativity rotations) are equal 
magnitudes. This enables us to compare magnitudes in perpendicular 
directions immediately, and magnitudes in antiperpendicular directions 
indirectly, using for the latter a direction perpendicular to both as inter- 
mediary. But this does not apply to the algebraic direction which has no 
direction perpendicular to it. The measurement of proper mass, proper 
energy, proper time, in terms of the standards used for measuring momenta 
and lengths in other directions, must depend on a different principle of com- 
parison. This is supplied by the formula (6-64) for the strain vector of a pure 



12-6] The Mass-ratio of the Proton and Electron 221 

particle. Its momenta in four antiperpendicular directions E l9 /? 23 , J5? 45 , j 16 
are equal. This enables us to compare the momenta in the E 1Q direction with 
the momenta in other directions. If, as suggested above, a different scale 
constant J? were employed for the J? 16 direction, a stream vector which is 
pure in angular measure would not be pure hi linear measure; and our whole 
treatment of the analysis of matter into pure wave functions representing 
elementary particles would have to be revised. 

We may put the result in another way. An elementary particle possesses 
spin. We can measure time by the spin coordinate of the particle as time 
is in fact measured by the spin coordinate of the earth. Thus x l9 x n can be 
substituted for # 16 , #i 6 ,i 6 in our previous deduction; and the difficulties 
arising from the peculiar character of the algebraic coordinate are then 
evaded.f This brings out the fact that the mass which we have determined 
is that of an elementary particle. A distinctive feature of the elementary 
particle is that it has four dynamical coordinates all of which measure the 
time; but two of them are not shown in phase space. 

12-6. Energy Invariants. 

The primary operation of wave mechanics is multiplication; and in the 
combination of two systems the product of the masses m^m^ is more funda- 
mental than the sum m^m^. This was illustrated in the transformation to 
relative coordinates, in which the original particles are replaced by external 
w i internal particles whose masses Ji/, p, satisfy M/Z = m 1 m 2 . 

We shall call m^m^ the mutual pressure invariant of the two particles. 
Correspondingly m x 2 is called a self-pressure invariant. 

Let us provisionally interpret the term " pressure " literally. In statistical 
mechanics the pressure is the energy associated with one degree of freedom. J 
If there is equipartition between n degrees of freedom, the energy is nm l m 2 . 
We shall call nmm 2 the mutual energy invariant. The self-energy invariant is 
defined similarly. 

The fundamental equation (12-45) 

n^mm' = nm 2 + n'm' 2 

expresses that the mutual energy invariant of the system $ is the sum of 
the self-energy invariants of the two parts composing it. We thus get a kind 
of physical picture of the significance of the fundamental quadratic. In 
particular the equation I 36mmo== io m 2 + m a (12-61) 

t The algebraic coordinate was employed because we knew what were fct corresponding" 
displacements of the three systems in that coordinate but not in any other coordinate. 
But the above argument shows that, besides (12-24), we have x l = x ll . 

J We have in mind waves in which the energy is half kinetic and half potential. If kinetic 
energy alone is present a factor is introduced. 

We shall in future denote the mass of the neutral comparison particle by m instead of 
m' since m' seems a rather unsuitable notation for an important constant of nature. 



222 Physical Applications [12-6 

means that the introduction of the elementary particle ra can be regarded 
either as replacing a neutral unspecified particle m by a double system with 
136-dimensional phase space or as adding a simple system with ten-dimen- 
sional phase space. Both interpretations give the same total energy in- 
variant. From this point of view the number of dimensions of phase space 
is involved, because in the steady state represented by the initial probability 
distribution, each dimension shares in the equipartition of energy. 

Let us extend this to a static system formed by two elementary particles 
with a neutral comparison fluid. We know that two protons or two electrons 
cannot form a static system; and we may therefore anticipate that the two 
particles will be a proton and electron. In short, our system is a hydrogen 
atom (or possibly a neutron). 

We resolve the hydrogen atom as in 10-9 into an external and internal 
particle, described by external and internal wave functions, with masses 
M = m p + m e , p. = m p m e /(m p + m e ). (12-62) 

Since m p , m e are the roots of 10m 2 - 136wm + w 2 = 0, we have 

A/ = (136/10) w , ^=(1/136),,. 
Writing these in the form 

136jiw =10*f 2 , 136^wi = m 2 , (12-63) 

we see that for an external particle the mutual energy invariant of the 
particle and its comparison fluid is equal to the self-energy invariant of the 
particle; but for the internal particle the mutual energy invariant is eq 
to the self-energy invariant of the comparison fluid. The two particles have 
therefore a very different type of relationship to the comparison fluid. 

We can extend the foregoing definition of pressure and energy invariants 
to conditions in which the energy is represented by a symbolic operator W. 
The self-energy invariant of a particle in phase space of n dimensions is 
then n W 2 . If the wave functions representing steady states satisfy Dirac's 
equation W$ = m[i 9 it follows that (JP/ra)^ = ra0. Thus we have two oper- 
ational forms of the energy w, namely the linear hamiltonian W and the 
second order hamiltonian W 2 /m. For the external particle 

(external particle), (12-641) 

where y e is the natural constant 136m , and n is the number of dimensions 
of the phase space of the external particle. For the internal particle 

136F 2 (internal particle), (12-642) 



where y i = y e /136 2 

and n is unity. Thus y i9 y e are the factors which convert energy invariants 
into energies on the ordinary scale. 



12-7] The Mass-ratio of the Proton and Electron 223 

For an electron or proton, the fundamental quadratic can be written as 



so that the operational form of the energy is 

- (nW 2 + 2 ) (elementary particle). (12-643) 

7e 

It is interesting to compare (12-643) and (12-641). 

I think that for theoretical purposes the energy invariant is to be pre- 
ferred to the energy; that is to say, we should replace the quadratic energy 
operator JF 2 /w by nW 2 /y. In the following chapters I have not used the 
energy invariants as much as I might have done, fearing that it would be 
a stumbling-block to the reader. When the form W 2 /m is used, it should 
be realised that m is a combination of the universal constant ra with a 
geometrical factor of the problem, the latter introducing the number of 
degrees of freedom. It is also particularly important to notice that m 
is not necessarily the rest energy; for internal wave functions (which 
are those most concerned in practical problems) the rest energy is zero 
(10-982). 

12*7. The Association of Mass and Charge. 

We have found that an elementary particle must have one of two masses 
m p , m e . We shall now show that in static conditions the particles of mass 
m p are of one sign (arbitrarily called positive), and the particles of mass 
m e are of the other sign; also that the positive and negative charges are 
equal. 

To test the sign of the charge we introduce a field of uniform electrostatic 
potential * 4 . If we can show that the effect of the field is to make the mass 
m* of an elementary particle at rest satisfy the modified equation 



0, (12-71) 

where a is a function of /c 4 , the required conclusion follows. For then 

m p * + m e *=^m Q = m p + m e , (12-72) 

so that we may write m p * = m p + c, m e * = m e , (12-73) 

showing that the part of the mass or energy due to the existence of the field 
is equal and opposite for the two types of particle. Our problem therefore 
reduces to showing that * 4 affects only the last term of the quadratic. 

By 8-8 the constant potential * 4 can be removed by applying the gauge 
transformation 0' = e ilf **4^ to wave vectors of index 1. The strain vector 
of index 2 generating phase space then undergoes the transformation 



224 Physical Applications [12-7 

g> = e 2tK.x 4 s = c -2ic 4 / # g a i r eady contains the time factor e* e * = e***; so that 
the time factor of S' is e ii>IR , where 



4 lZ) = (l-2ic JK). (12-74) 

Thus the field-disturbed environment of the object system is converted 
into the standard environment (neutral space-time) by antedating our 
description of it.f In analysing ( 12-4) the double wave functions, which 
contain the observable relations, into a simple wave function of the particle 
and a one-phase wave function of the comparison fluid, we gave to all three 
functions the same time t\ if, however, we give the one-phase wave function 
a time t', it will equally represent a neutral comparison fluid at time t' or 
the actual field-disturbed environment at time t. With this modification 
(12-43) becomes 

(1 + * 2 /4JR 2 )- w = (1 + * 2 /4jR 2 )-' 1 (1 +f i/4/Z' 1 )-*' (12-75) 

and gives the condition for analysis of the double wave functions, containing 
the observable relations of the particle to its field-disturbed environment 
at time t, into a simple wave function of the particle and a one-phase wave 
function of its field-disturbed environment at the same time t. The change 
only affects the last term of the quadratic equation, which becomes 

nra 2 -n ram' + (e7$) 2 m' 2 = 0. (12-76) 

This is the result we required. 

We do not apply the gauge transformation to the double wave function, 
because that contains the observable relations, and it is understood that 
the energy of the particle is to be determined from observation precisely as 
if the actual environment were the standard environment. As we should 
ordinarily say, the change from m to m* is an apparent change of mass, 
due to the difference between the comparison fluid actually present and 
available for our observations and the ideal comparison fluid referred to in 
the definition of m. 

The value of a in (12-71) is (l-2/c OJ R'); or, since ll'2R f = m 9 (formerly 
Call6dm/ >' a=l-K /m . (12-771) 

Using (12-73), we derive the relation 

-AWiM 2 . (12 . 772) 

>e) \ / ' 

Thus a is real for a physically real electrostatic potential, and (for electro- 
magnetic fields which are not too extravagantly great) is also real. By a 
happy accident, positive electric potential corresponds to positive * . 
The numerical value of a does not much concern us; but if any application 

t The reader may perhaps be inclined to object that, if (as usual) the comparison fluid 
is at rest, antedating makes no difference. But there is a phase angle which changes with 
the time; and we see above that it is this phase angle which exhibits the antedating. 



12-7] The Mass-ratio of the Proton and Electron 225 

is made of (12-771), we should notice that /c is not there measured in the 
same units as in the ordinary wave equation since it refers to a gauge 
transformation of the comparison fluid. 

It is interesting to consider the physical basis of the dissymmetry of 
positive and negative charges. Formally they are correlated to the two 
square roots of 1. This can only give rise to observable dissymmetry (such 
as a difference of mass) if, elsewhere in the theory, the two square roots have 
been allotted unsymmetrical roles in the description of phenomena. In the 
expression for a physically real interval E l x i + E z x 2 -{- E^x^ + E^it we have 
to select one of the square roots of 1, which we shall denote by iy, such 
that Etijt represents displacement towards the future when t is positive. 
Then any other square root of 1 occurring in our formulae is distinguished 
as if or ij . 

The irreversibility of time is manifested in three ways: (a) in consciousness, 
(6) in the laws of entropy, (c) in the cosmic expansion. Consciousness must 
be regarded as the ultimate source of the irreversibility, at any rate from the 
point of view of physical theory. Owing to the curious fact that our minds 
are acquainted by sensory mechanism with the past but not the future, 
observational knowledge has the form of an integral over past time up to 
the present moment t. Since probability is relative to knowledge, a formula- 
tion of the universe in terms of probability distributions is exceedingly 
unsymmetrical with respect to past and future time; and this irreversibility 
is shown in the manifestations (b) and (c). It is also shown in wave mechanics 
in a more elementary way by the concentrated wave packets, which are 
formed discontinuously by our observations and diffuse continuously as t 
increases towards the future. 

For the distinction of protons and electrons the most relevant mani- 
festation is the cosmic expansion. I do not mean the actual "expansion of 
the universe", which might perhaps have been reversed by altering the 
initial conditions. But a region of space with de Sitter metric necessarily 
expands relatively to our standards of measurement. We generally treat a 
small region of space isolated from the rest by artificial boundary conditions; 
but the dilemma, first pointed out by de Sitter, always appears either we 
must take it to be a portion of static spherical space (an Einstein universe), 
in which case Lorentz transformations are inapplicable; or we must take 
it to be a portion of de Sitter space-time, in which case geodesies that are 
initially parallel proceed to diverge. Since we have introduced space-time 
by Lorentz transformations, our method follows the latter alternative. 
Correspondingly we have an expanding comparison fluid, which is un- 
symmetrically related to past and future time.- In particular, elementary 
particles symmetrically related to the expanding comparison fluid are not 
symmetrically related to past and future time. 



226 Physical Applications [12-8 

12-8. The Stern-Gerlach Effect. 

We shall now calculate the energy of a hydrogen atom in an external 
macroscopic electromagnetic field. Representing the atom by external and 
internal wave functions, we have to calculate the effect of the field on these 
separately. An essential point in the investigation is that the external wave 
functions are continuous and the internal wave functions are discrete. 

We use the method of gauge transformations explained in 8-8. Let us 
introduce a field of electromagnetic potential by changing the local unit of 
length, so that the scale constants R, R r of the phase spaces of a particle 
and its comparison fluid become in the new measure j8~ 1 JJ, /J- 1 ./?', where j8 
is a unitary complex factor. The coefficient V g in the reduction from co- 
ordinate volume to natural volume will be changed to ]8 10 V g for the ten- 
dimensional phase space and j8V gr for the one-dimensional phase space. 
We do not recognise complex measures of volume; accordingly the factors 
are transferred to the strain vectors which become /J 10 $, pS'. The transfer 
means that the change of probability distribution due to the electro- 
magnetic field is incorporated in the modifying factor instead of in the 
initial probability distribution. 

The masses of the particles are such as to validate the usual assumption 
that the volume elements can be ignored in the dynamical equations; that 
is to say, 8 and S' can be treated as discrete strain vectors in the dynamical 
equations. But they do not behave as discrete wave vectors for gauge 
transformations, and consequently the electromagnetic terms in the 
dynamical equations are not the same. Let us now consider the gauge 
transformation for a discrete strain vector. 

A discrete wave function exists only for discrete values of its parameters; 
but it is a continuous function of coordinates, so that it is affected by the 
gauge transformation of the volume element of coordinates. By the theory 
of Chapter vm a discrete wave function has four dynamical coordinates; 
whence it follows that the transformation of the strain vector is S->/3*S. 
We obtain the same result more directly from the ordinary expression 

W+V^gdXtdxidXi (12-81) 

for the probability in an element dx : dx 2 dx 3 of three-dimensional space. 
Here V g is the four-dimensional factor, defined in ordinary tensor 
calculus, which varies as ]8 4 .t The factor ]8 4 is therefore attached to the 
discrete strain vector function ^*. 

We have therefore the following result. Although the external wave 

function is continuous, it is permissible to treat it as discrete for the ordinary 

purposes of wave mechanics, the mass M assigned to it having been expressly 

chosen so as to validate this practice. But if a gauge transformation is 

f Mathematical Theory of Relativity, equation (85-44). 



12-8] The Mass-ratio of the Proton and Electron 227 

applied, the strain vector will be changed in the ratio J3 10 whereas the 
change for a discrete wave function would have been in the ratio j8 4 . The 
wave functions $, <f> are therefore changed in the ratio j3 6 , as compared with j3 2 . 
Applying the theory of 8-8 with A = j8 2 , j8 5 , if the potentials in the wave 
equation for a discrete wave function are K^, those in the equation for the 
external wave function are f/c^. Thus for the same electromagnetic field 

we have g 

p = - i - + ic (discrete wave functions), (12-821) 

^'li 

JP = i h f K (continuous wave functions). (12-822) 
o*i> 

The hamiltonian of the external particle is therefore 

-O < = S*' (12-831) 



i ^ 
which gives the second order hamiltonian 



If the field consists of a magnetic force eH = 3/c 2 /3# 3 - 3K 3 /3# 2 * n the ^ 
direction, the last term becomes (f ie/^/3/) E^ . To the classical approxima- 
tion this term is twice the mutual energy of the particle and electromagnetic 
field. The mutual energy is thus 

* ^-l.jiJS^. (12-84) 

The factor UE 23 is the spin momentum of the external particle in the plane 
normal to H and has eigenvalues J. The factor f e/M, or in the usual units 
|/MC, is the effective magnetic moment per unit angular momentum. 
The corresponding result for the internal particle is 

-^.Z, (12-85) 

where the operator Z includes, besides the spin momentum, half the orbital 
momentum. f The factor f is now omitted because the internal wave func- 
tions are discrete. But it is necessary to consider whether they are the 
standard discrete wave functions in four dimensions referred to in (12-821), 
or whether the fact that internal space is three-dimensional will not give 
K ' = f * , and an energy o A JJ 

.~-Z. (12-86) 

I think that for the theory of the atom, so far as it is developed in Chapter 
ix, (12-86) is the correct result, the factor f being essentially the same as 
that which will occur later in (14-153). It is true that the internal wave 
function has the standard number of dynamical coordinates, and appears 

f The factor & is the well-known magnetic anomaly. 

15-2 



228 Physical Applications [12-8 

in 9-2, 9-3 to be a function of t as well as of the space coordinates. But 
although t is used in Chapter ix in order to agree with the usual notation, t is 
not the time; it will be shown in Chapter xv that it is an interchange co- 
ordinate. Now the interchange coordinate is by its nature gauge-invariant. 
Thus for internal space, V g varies as j3 3 instead of j8 4 , and the factor f is 
required. 

But it is well known that, when a magnetic field is applied to an atom, we 
have to distinguish between weak and strong fields. A strong field cannot be 
treated as a perturbation of the normal states, but involves a re-analysis into 
elementary states. The particle is, as it were, torn in its allegiance between 
the planes of simultaneity determined by the external momentum vector 
and those determined by H^ (with time direction so chosen that the field is 
purely magnetic). This effect of strong fields is generally described as an 
"uncoupling" of spins. The magnetically determined planes of simultaneity 
introduce a genuine time coordinate into the internal state additional to the 
proper time s correlated to the external momentum vector; the original t 
(interchange coordinate) is then relegated to the role of argument for small 
perturbations, and the states are re-analysed with respect to the genuine 
time coordinate. It appears therefore that (12-85) is right for strong fields; 
but it will be replaced by (12-86) in weak fields, where a gauge-invariant 
interchange coordinate takes the place of the genuine time coordinate. 

It is to be understood that in our equations H is not necessarily a measure 
of the field in absolute units; it is used to connect (12*84) alternatively (but 
not simultaneously) with (12-85) or (12-86). Actually in passing from (12-85) 
to (12-86) we should change the unit of energy in proportion to the number 
of degrees of freedom, so that they represent the same absolute energy. 

By suitable arrangements a stream of particles, projected in a strong 
inhomogeneous magnetic field, can be made to divide itself according to the 
different combinations of eigenvalues of (12-84) and (12-85). From the 
measured deflections the ratio of the factors fe/Jif c and e/fjLC can be determined. 
The generally accepted results give the numerical coefficient f , agreeing 
with our theory. Since M and //, are very nearly equal to m p and m e , the 
magnetic energies are generally attributed to the proton and electron, 
respectively. But it is clear from the foregoing investigation that they 
properly belong to the external and internal motions, respectively. 

The result for weak fields also appears to be confirmed by experiment 
(Babi, Kellogg and Zacharias, Physical Review, 46, 157 (1934)). In this case 
the ratio is fe/jfc : f e//uc, so that the numerical coefficient is \-. The 
experimental result is stated to be 3-25 0-3. 

Whilst the result for strong fields is a simple consequence of the theory, 
the theory for weak fields may perhaps require a closer scrutiny than I have 
been able to give. 



CHAPTER XIII 
STANDING WAVES 

13-1. Scalar Wave Functions . 

Vector wave functions have superseded the scalar wave functions of 
Schrodinger in the problems of atomic physics; but there are certain applica- 
tions of wave mechanics in which scalar wave functions are still needed. 
These will now occupy us for two chapters. We shall develop the theory of 
scalar wave functions independently, and afterwards show how the vector 
wave functions hitherto treated are connected to them. 

We have seen that a neutral particle (neutral both as regards charge and 
spin) has an algebraic strain vector ( 6-6); in this case the strain vector S 
becomes a strain scalar. Following the general procedure in wave mechanics, 
we represent it as the product of two scalar wave functions 0^. We may 
expect that scalar wave functions will suffice to represent any macroscopic 
distribution which is neutral as regards charge and spin in fact, such 
distributions as are fully specified by an energy tensor T^ v . If T^ is inade- 
quate and the distribution is of a type represented by a general Riemann- 
Christoffel matrix (11-4), vector wave functions will be required. The 
general theory of the representation of T^ v by scalar wave functions is given 
in 13-7. However, it is not of primary importance to ascertain the precise 
limits of the application of scalar wave functions; the main consideration is 
that they occur in certain problems of great importance in connecting 
relativity theory with quantum theory. 

In macroscopic theory, energy-density, momentum-density and pressure 
are components of an energy tensor T^p . It is part of the definition of these 
quantities that they satisfy the law of conservation (T*P)p = 0; and by this 
property T a p is identified with a geometrical tensor 

-87r/c2 T a/ , = C? a]5 ~j9f aj8 ((?-2A), (13-11) 

which satisfies the law of conservation identically. We insert the constant K 
(constant of gravitation) in order that we may be free to use the unit of 
mass, defined by jp ft = id/dx^, which simplifies the formulae of wave 
mechanics. 

Here A may be any constant, a change of A being a change of the zero from 
which energy and pressure are reckoned. But the zero condition, 2^ = 0, 
should not be taken to correspond to entire absence of matter and radiation 
(or in wave mechanics to zero probability of the presence of particles and 
photons). For if we adopt this as the zero condition everywhere, we are 
comparing the actual universe with a universe completely devoid of matter 



230 Physical Applications [13-1 

and radiation and therefore without observable properties. We cannot adopt 
as a reference standard for physical measurements a system in which all 
such measurements become indeterminate. We shall later consider the 
convention adopted for fixing A in current practice. 

To make a transition to wave mechanics, T^ must be interpreted as the 
expectation value of an operator T a . The only appropriate operation is 
covariant differentiation, and to satisfy tensor conditions we adopt 



where 8 stands for covariant differentiation, and m is an invariant. We shall 
be concerned only with scalar wave functions and natural coordinates; so 
that in our applications the covariant derivatives can be replaced by ordinary 
derivatives. 

No hypothesis is implied in (13-12). It defines the wave functions which 
will be used for the description of macroscopic systems, viz. they are to be 
such as to give at each point an expectation value of T a ^ which agrees with 
the tensor defined in (13-11). The information contained in T^p is thereby 
transferred to a pair of wave functions and made available for treatment by 
the methods of wave mechanics.f 

Equation (13-12) is consistent with the usual definitions of momentum 

p a =-i8/S# a , pv^rndx^ds. (13-131) 

For, substituting in (13-12), we have 

T a)3 = p j 8P a /m, (13-132) 

so that, when the momenta reduce to eigenvalues, 



*. (13-133) 

m da ds v ' 

which is a well-known form of the energy tensor of a system of particles 
when the right-hand side is summed in an appropriate invariant way.J 
Here, however, we regard (13-12) as the fundamental formula, not implying 
analysis into particles with eigenmomenta, but directly translating the 
curvature invariants of macroscopic theory into the operational forms of 
wave mechanics. 

The coefficient m should not be prematurely identified with the proper 
mass of the system represented by the wave functions. As it is of the dimen- 
sions of mass, we may call it the mass-constant of the system. In most 
applications the units are adjusted so that, for an elementary wave system, 
m is the proper mass of the corresponding particle. For a composite wave 

f In 13-7 we shall follow the inverse procedure. The wave functions are defined directly, 
and the operational form T a(3 required to represent T a(3 is deduced. 
J Mathematical Theory of Relativity, equation (53-1). 



13-1] Standing Waves 231 

system m can be interpreted as the proper mass of a single equivalent 
particle which is by no means the same thing as the sum of the masses of 
the particles. For the purposes of general theory the division by m is rather 
inappropriate, and it would be better to write (13-12) as 

wT aj5 = -S*/S&0&c a , (13-125) 

defining a pressure invariant operator mT^p (cf. 12-6). 
The expectation value is formed in the usual way, namely, 

Z!0 = fr^. (13-141) 

Thus tensor conditions are satisfied if <f> and \f> are invariants. For a small 
three-dimensional volume dV 

T^pdV^^T^dV. (13-142) 

If a component T u p reduces to an eigenvalue t^p, we have 

T^pdV^t^p.^dV. (13-143) 

This is conveniently regarded as expressing that the oc/J-component of the 
energy in dV is due to a probability </>^dV that an entity with energy com- 
ponent ^p is within the volume. We call this entity a scalar particle. It will 
be found later that an elementary scalar particle is equivalent to four 
elementary charged particles. 

If the wave functions of the distribution are analysed into a set of elemen- 
tary orthogonal wave functions, each of which is a discrete eigenfunction 
of certain components of T a p (in practice, the diagonal components) 
sufficient in number for the eigenfunction to be defined uniquely by their 
eigenvalues, we can introduce an exclusion principle. The exclusion prin- 
ciple states that in the whole domain of V there is not more than one scalar 
particle for each eigenfunction; so that the whole probability attached to 
any set of eigenvalues is not greater than 1. Using normalised eigen- 

functions $ ny (f> n which satisfy 

* 

(13-144) 

$ n and <f> n will occur in the general wave functions Sa n n , Sa n '^ n with 
coefficients not exceeding 1. Or, if w ' = a w n , ^ w ' = tt n '^ w > 

(13-145) 

The foundation of this exclusion principle will be investigated in Chapter 
xvi. Meanwhile we accept the general idea of such a principle from current 
quantum theory. 

The important point now arises that, since ifi n , <f> n are invariants, the con- 
dition (13-145) is not invariant for Lorentz transformations or other 
relativity transformations which change dV. Thus, if it applies at all, it 



232 Physical Applications [13-1 

applies to a special time axis. The particle interpretation is subject to the 
drawback, that by changing the time axis, the probability of the particle's 
existence in the domain V may become greater than unity ! This has been 
urged as an objection to the use of scalar wave functions; and Schrodinger's 
development of them has been condemned as non-relativistic. Briefly the 
criticism is that, in order that the probability <f>^dV of the particle being in a 
specified three-dimensional volume may be invariant for transformations 
of the coordinates, </>i[i must be a vector in the direction normal to dV. 
Therefore </> and ifi cannot be scalar quantities, but must be wave vectors 
whose product yields a space vector. 

Much of Schrodinger's theory is non-relativistic; but in this instance 
the condemnation is not justified. The criticism overlooks that, whereas 
(13*142) is a tensor equation, invariant for all relativity transformations, 
(13-143) is definitely not invariant for any continuous transformation, since 
it is restricted by the non-invariant condition that $ corresponds to the 
eigenvalue t^. By hypothesis the eigenvalues defining $ are discrete; 
otherwise no question of an exclusion principle arises. If we make an in- 
finitesimal transformation of the coordinate system, the adjacent values of 
t a p are impossible as eigenvalues, and $ does not exist. It is a consequence 
of the discreteness of the eigenvalues that (13-143) can apply only to special 
orientations of the axes. 

The physical reason is evident. Discrete eigenvalues arise from boundary 
conditions; and the components of the energy tensor which are quantised 
are those which have a special relation to the characteristics of the boundary. 
In particular, if the boundary is of constant form, it determines a space-time 
frame with respect to which it is at rest. The time component of the energy 
tensor thus becomes separated from the other components. 

We have said that (for the present) we base the general idea of an exclusion 
principle on current quantum theory. As formulated for atomic systems it is 
amply confirmed by experiment. Undoubtedly an exclusion principle of 
some kind is obeyed by macroscopic distributions, and it is important to 
discover its precise formulation. There has not been the same opportunity 
for experimental test,f and formulae are current which have not been 
quantitatively verified. It is therefore necessary to examine the principle 
critically. 

It is agreed that the unit wave functions (commonly regarded as repre- 
senting individual particles) are eigenfunctions of a certain number of in- 
dependent operators U^, and that the function is determined uniquely 
when the eigenvalues u^ are specified. Also the eigenvalues are discrete, 

f I think that the calculation, which we shall make later, of the constant of gravitation 
and the cosmical constant provides the first quantitative observational test. A qualitative 
test has been provided by the phenomenon of white dwarf stars. 



13-1] Standing Waves 233 

although in some applications they may be very numerous and fall close 
together. It is therefore impossible to vary an eigenstate continuously; 
and if the distribution is given a small velocity, as in a Lorentz transforma- 
tion, it must cease to be an eigenstate. It is therefore a necessary condition 
for an exclusion principle that it shall not satisfy the Lorentz transformation. 
Effectively this means that it applies to the relative or internal coordinates 
of a system as is illustrated in the application to the hydrogen atom. For 
a neutral macroscopic distribution the choice of operators U^ is very limited, 
since the only recognised characteristic of the distribution is T^ . In familiar 
applications <f> and are taken to be eigensymbols of the four diagonal 
components of T^\ so far as 1 can see, the only other possibility would be to 
employ derivatives (more especially curls) of T a p . 

For the most part, what we have said about Lorentz transformations 
applies also to ordinary rotations. But when, owing to the symmetry of a 
system, it is impossible to distinguish its orientation with respect to its 
surroundings a degeneracy occurs; and the discrete set of eigenf unctions 
<f>, ^ is defined by operators U^ which include angular momenta. 

There is no ground for supposing that an exclusion principle applies when 
the distribution is not in a steady state; and it seems evident that it does not 
apply. The energy concentrated in one elementary state of an organ pipe 
may (temporarily) be very much greater than hv ( 13-5). Consequently the 
principle cannot be extended from discrete to continuous wave functions, 
since the latter postulate an unbounded region in which equilibrium cannot 
be realised. When the eigenstates are very close together we may, for mathe- 
matical convenience, treat them as continuous, e.g. the states of a hydrogen 
atom just below ionisation; but this statistical continuity must be dis- 
tinguished from the genuine continuity of the eigenstates which begins just 
above the ionisation level. If the ionised hydrogen atom is in a finite en- 
closure, the free electron can attain a steady probability distribution; but 
the eigenstates have again become discontinuous owing to the boundary 
conditions imposed by the enclosure. 

The exclusion principle is often stated in the form that there is not more 
thanoneelectronperunitcell(A 3 ) of phase space. This is the most convenient 
form for statistical purposes, feut since it no longer explicitly requires the 
wave functions to be discontinuous, many physicists have assumed that it 
applies to systems represented by progressive waves, which are not in 
statistical equilibrium. Any proposal to extend formulae relating to statis- 
tical equilibrium to non-equilibrium conditions requires justification; but 
no such justification has been attempted, and there is neither theoretical 
nor experimental ground for the assumption. 

We add a comment on the association of particles with the unit wave 
functions. We are not attempting to analyse the macroscopic distribution 



234 Physical Applications [13-1 

into the elementary particles, or even the neutral particles, treated in micro- 
scopic theory. Any particles which we now introduce are average particles, 
having composite individuality; and as such they have different properties 
from the elementary particles. It is permissible to regard an electron of 
composite individuality as an electron, in the same sense in which it is per- 
missible to regard the Prime Minister as a human being although he is 
subject to changes of size and appearance impossible in a human being as 
biologically defined, and might undergo displacement from Lossiemouth to 
Bewdley without traversing any intermediate region. An average particle 
is the particle which is fulfilling a particular role; and the conception arises 
when there is in the assemblage always one particle, or a probability of one 
particle, playing that role; but the identity of the particle is continually 
(and, for statistical purposes, continuously) changing. We shall find later 
that it is impossible even in microscopic theory to assign identity to particles 
at different times in ah absolute way. But it is here unimportant whether 
the continuous identity which we assign to a microscopic particle is absolute 
or relative; adopting the microscopic particle as standard, the identity of 
the average particle associated with a macroscopic eigenfunction is com- 
posite and continuously changing. Naturally therefore it does not obey the 
same laws as a microscopic particle, in so far as the laws involve d/dt. Since 
d/dt gives the mass and energy, the mass and hamiltonian are different. 
There are, however, well-known theorems concerning the density of distri- 
bution of eigensolutions, which normally secure that the number of particles 
found in different modes of dissection is invariant. We are therefore able to 
state the number of elementary particles corresponding to a given number 
of average particles. 

13-2. The Box Problem. 

Consider a rectangular block of matter (e.g. in the interior of a star) of 
dimensions Z x x Z 2 x Z 3 , and containing al^LJi^ electrons. We wish to deter- 
mine the minimum electron pressure P as a function of the electron density a . 
We take axes such that the block as a whole is at rest, and we analyse the 
internal state of the block into standing waves in the relative coordinates 
i & fa- This is done by resolving the scalar wave function (within the 
block) into Fourier components, so that ^ ^^n v n v n^ where 



(13-21) 



"2 



n n -n cos -^ COS 

n v n v -n 8 ^ j 



etc., and n l9 n 29 n$ are integers (positive in (13*21)). The factors can be cosines 
or sines, and (as a convention) we distinguish sine factors by negative values 



13-2] Standing Waves 235 

of 7i. The condition governing this analysis is that the wave functions 
defining the elementary states must form a complete set of orthogonal 
functions capable of representing any distribution of $ within the block. 

Let s* a =27ra a /J a , w * = w ^ + w f + tu 3 2 . (13-22) 

Taking w l9 w%, w 3 as rectangular coordinates, each elementary wave is 
represented by a point in ro-space. The points form a rectangular lattice 
with intervals 27r/Z a ; and their density in to-space is / 1 Z 2 / 3 /(27r) 8 . By the 
exclusion principle there are not more than two electrons (and two protons) 
to an elementary scalar wave.f Hence we shall require 0^/2^3 waves, 
which will occupy a minimum volume i (27r) 3 a in ro-space. If this volume 
forms a sphere of radius r, 4^3 = | (27r )3 a . (1 3 . 2 3) 

The pressure is given by the operator 

P = J (T u + T 22 + T 33 ) = - V 2 /3w (13-24) 

by (13-12). For an elementary wave *l>n v n v n^ ^^is reduces to an eigenvalue 



by (13-21) and (13-22). Hence 

Py 2 Z 3 = S w*/3m = oi^Zg .o*/3m, (13-25) 

where ra 2 is the mean value of to 2 . We have therefore to select the waves 
which give the minimum value of w*. This is obtained by packing them in 
the sphere of radius r above mentioned. Then 



(13-26) 

by (13-23). Hence, by (13-25), 

D _l/3\t(fcr)i 



In C.G.S. units the factor (27r) 2 in the numerator is replaced by A 2 , our 
present units being such that A/2?r is unity. (The coefficient of the momen- 
tum operator is A/2?r, not A/TT, since $ is an internal wave function.) The 
constant m is identified with m e . 

This is called the ordinary degeneracy formula. It was first applied by 
R. H. FowlerJ in his investigation of the state of matter in white dwarf stars 
in 1926. But for many years it has been discarded by astronomers in favour 
of a supposed relativistic degeneracy formula. The " relativistic " degeneracy 
formula appears to be without foundation. The difference is important in 
the theory of evolution of white dwarf stars; and it was the paradoxical 
results of the "relativistic" formula, disclosed in an investigation by 
S. Chandrasekhar, which led me to examine its validity. || 

t We take this result from current theory. 

i Monthly Notices, R.A.S. 87, 114. 

Ibid. 95, 207. 

|| Ibid. 95, 194; 96, 20. See also 13-8. 



236 Physical Applications [13-2 

From a theoretical standpoint it is of greater interest to take the material 
to be ionised hydrogen, and calculate the whole pressure including proton 
pressure. The separation of the electron pressure from the proton pressure 
is somewhat artificial, since the electrons could not exist alone. If a is the 
number of scalar particles per unit volume, and ra the mass-constant of a 
scalar particle, i.e. the constant ra in (13-12), the minimum pressure is found 
by the foregoing method to be 



Suppose that I == ^ ==2 ( + ) (13-291) 

^ ra ra \m p mj 

by (12-63). Then (13-28) becomes 



Since the electron density is now 2<j, this agrees with (13-27) supplemented 
by a corresponding proton term. 

We therefore conclude that the coefficient ra for a scalar wave function 
is as given in (13-291). This is in the system of units with A = 2rr. We recall 
that when the relativistic double valued wave vectors are employed, the 
units are such that h = TT ( 9-6). In that case 

ra = 2ra /136. (13-293) 

The factor 136 is a transformation factor due to our normal reckoning of 
mass being determined by double vector wave functions with a phase space 
of 136 dimensions, whereas the double scalar wave function has a phase 
space of one dimension. We could eliminate it by assigning appropriate 
indices to scalar and vector wave functions (8-65). The factor 2 occurs 
because here the internal coordinates f ^ are relative to the centre of mass, 
whereas in the simple transformation (10-911) ^ is the coordinate of one 
particle relative to another. 

13-3. The Energy of Standing Waves. 

The domain of the internal coordinates ^ is three-dimensional, there being 
no relative time coordinate. Consequently T^ for standing waves consists 
of spatial components only. This is not the whole energy tensor; the re- 
mainder is provided by the external wave function which represents the 
motion of the block as a whole. The rest-mass is contained in the external 
wave function. In the present coordinate system (chosen so that the block 
as a whole is at rest) the external energy tensor consists of a single com- 
ponent 2V Since f^T^^T, we have in our coordinate system (with real 

time) Tu-T + Tu + TB + Ta, ( 13 ' 31 ) 

which is also written as / >=p + 3P. (13-32) 



13-3] Standing Waves 237 

Consider now a different set of standing waves in the material of the 
block, so that the pressure P is changed. One or both of the quantities />, p 

must be changed. Let 
8 



(13-33) 

where /> ' is constant, and /? may be zero or constant or a function of P. The 
usual point of view is that it is more appropriate to divide the whole energy 
tensor into a constant external part T^p^, and group together the part 
(P, P, P, (3 + j3) P) which comprises the whole effect of the standing waves. 
To provide for the energy (3 + /?) P we must insert a time factor e iki in the 
functions *ff n t n % n , k being dependent on (n l9 n 2 , n 3 ) or equivalently on 
(w l9 m 2 , iu 3 ). Owing to our special choice of axes t is identical with the 
dynamical coordinate s of the internal state; so it is not so incongruous as 
it might seem, to mix it with the internal coordinates f^. There are no 
boundary conditions for determining k, and it must be determined from the 
energy density (3 + j8) P; but this requires a knowledge of /?. 

The problem of finding the hamiltonian of the standing waves, i.e. the 
expression for the energy operator id fit in terms of the momentum 
operators id/dg^, thus resolves itself into determining the constant or 
function j8. We wish to find the change of energy density or mass density 
85P 44 , when energy is added to the system in the form of standing waves 
producing a pressure ST U . 

It is commonly taken for granted that the answer to this problem can be 
checked by observation. No such test is possible; and any answer we may 
adopt must rest on convention, not observation. Consider a vessel con- 
taining gas. I do not doubt that when the pressure is increased by raising 
the temperature of the gas, the mass (measured in the ordinary way by the 
acceleration of the vessel under a given applied force) is increased by the 
amount of the heat energy that is added. Further, if the gas is monatomic, 
its heat energy is fully represented by standing waves. But for our purpose 
the experiment is illusory. The integrated pressure of the gas is precisely 
balanced by the integrated tension of the walls of the vessel. The observed 
change of mass (integral of 8jP 44 ) is therefore not associated with any net 
change of T U9 but with a differential effect depending on whether T n is in 
a gas or a cohesive solid. The complication in the solid is that there are inter- 
atomic forces of cohesion, of a type which can only be represented by the 
use of vector wave functions. We can make an arbitrary addition ST^ = o&T n 
(a = const.) to the density at each point in the gas and vessel, since it is 
only possible to test observationally systems in which pressures and tensions 
balance. Accordingly, ^8T n = 0; and the addition cancels out on integration 
over the system. 

It may seem more hopeful to examine a steady system without a con- 
straining boundary, e.g. a star cluster, or a star composed of monatomic gas. 



238 Physical Applications [13-3 

But has a star greater or less mass than the sum of the rest masses of its 
particles? It has more kinetic energy but less total energy. Which of these 
corresponds to the mass? Or, more precisely, which is represented by the 
integrated value of 5P 44 ? The answer to the latter question is neither ; for 
jP 44 cannot be integrated in an absolute way in a curved space. In trying 
to avoid a constraining boundary we have used curvature of space-time 
(gravitational force and potential energy) to keep the system steady; and 
the same indeterminacy now appears in the form of non-integrability . 

This observational indeterminacy is provided for in the initial formula 
(13-11). Tp V is indeterminate to the extent of an additive constant Agr^/, and 
the relation of S!F 44 to 8T n will depend on the arbitrary choice of a corre- 
sponding 8A. 

We have therefore to fix the relation between 8jP 44 and 8P by a convention. 
For reasons, which we shall presently explain, the convention is taken to be 

ST = 0, (13-34) 

so that j8= 0. This will fix the change (if any) of the gauge constant A, and 
it can therefore be regarded as a gauging equation. It is here asserted only 
for changes of standing waves; in particular it does not hold for regions in 
which there is unbalanced angular momentum. But it happens that it 
includes Maxwellian electromagnetic fields since these have a proper density 
T which is identically zero. 

The invariant T is the Action of the material system. The action of electro- 
magnetic waves (but not of aperiodic electromagnetic fields) is zero. Hence 
(13-34) secures that there is no change of action when radiation is absorbed 
and converted into standing waves in a material system. It may therefore 
be interpreted as a Principle of Stationary Action for variations of the equi- 
librium state of matter and radiation. It is in the form of an action principle 
that the convention (13-34) has become incorporated fundamentally in the 
current scheme of physics. 

We must therefore accept (13-34) as the current convention. Then j8=0; 
and, by (13-32) and (13-33), the energy density T 44 of matter whose internal 
state is represented by standing waves is 



(13-35) 
where p is constant. 

In the classical theory of gases, and in elementary quantum theory, 
only the kinetic energy, which is approximately fP, is considered; and 
the energy density is taken to be p=/> +fP. But changes of pressure 
cannot be produced without changes of the gravitational field ; and it is 
disastrous to introduce relativistic refinements without taking into account 
the changes of potential energy. 



13-4] Standing Waves 239 

13-4. The Use of an Action Principle. 

Wave mechanics is a statistical theory and its results refer primarily to 
systems in statistical equilibrium. Its procedure is to investigate the pos- 
sible steady distributions of probability, postulating an ideal environment 
of the system considered. Its dynamical equations are derived from the 
condition that a recognisable characteristic of the system remains steady 
this being taken as the criterion of a steady state. Herein lies the essence of 
the statistical method. For the complexions of a system originally regarded as 
distinct are regrouped according to the values of the selected characteristic; 
and the probability distribution of its components takes the place of the pro- 
bability distribution of the original classification. In particular, it is possible 
to find attributes for which, on the original basis of statistics, some values 
are infinitely more probable than others corresponding to singularities 
in the transformation from the old to the new classification of complexions. 

But how can a theory of steady conditions provide anything for obser- 
vation to get a grip on? Those influences from the external world which 
reach our senses are due to change and transition. 

We have made provision for perturbation of and by these steady systems. 
The first step is to introduce steady perturbations. These are found by treating 
the perturbed and perturbing system as a combined system in statistical 
equilibrium, and therefore falling within the scope of the statistical theory. 
We then derive an equivalent representation as two separate systems each 
of which is uniformly perturbing the other. The perturbations are expressed 
as changes of the probability factors attached to the steady states of the 
two systems. 

But it would seem that ultimately there must be some limit to the treat- 
ment of phenomena by methods based on the postulate of statistical equi- 
librium. The universe is far from statistical equilibrium; so that sooner or 
later we are bound to overstep the limits of the theory. I am not sure that 
this conclusion is logically sound. According to our usual outlook the 
universe is far from statistical equilibrium; but it may depend on how we 
choose the basis of statistical enumeration. The "recognisable character- 
istic " J of the universe is that it conforms in every detail to our accumulated 
knowledge of what has actually occurred during a period of a few thousand 
years. We can at least say that J is constant for changes of the dynamical 
time coordinate 8 except in so far as subjective influences (discovery that 
certain information is false) may cause " perturbations". But it is perhaps 
unlikely that the number of symbols U l9 U 2 , ... commuting with J is suffi- 
cient to justify an analogy with the theory of 9-1. 

Be that as it may, physics does not attempt to press the equilibrium theory 
to such an extreme, but breaks off in a new direction. We must make clear 
the nature of this fundamental departure. 



240 Physical Applications [13-4 

A simple illustration is afforded by the historical development of the law 
of gravitation. First the steady states of a combined system the sun and 
a planet were discovered, leading to the formulation of Kepler's laws. 
Then an equivalent representation as two simple systems, one perturbing 
the other, was found; and the perturbations were expressed in the form of 
the inverse-square law of gravitation. The next step was to assume that 
the same law of perturbation applies, whether the bodies form a steady system or 
not. The Newtonian and Einsteinian pictures of gravitation are such that it 
seems pedantic to emphasise the arbitrariness of the last step. What possible 
bearing can the steadiness of the system have on the matter? But our point 
is (1) that an analogous step must always be taken in developing the general 
laws of nature from the study of steady states, and (2) it is an exceedingly 
dangerous kind of generalisation to apply to statistical formulae. We know 
well the many fallacies which have arisen from applying to non-equilibrium 
distributions the laws found for statistical equilibrium. 

We have therefore to recognise that, woven into the method of physics, 
and forming an indispensable part of it, there is a hypothesis or an assump- 
tion or a convention (we leave the appropriate term for further consideration) 
that results obtained for systems in statistical equilibrium can, in certain 
circumstances which must be strictly defined, be applied to non-equilibrium 
systems. 

The hypothesis or assumption or convention is the Principle of Stationary 
Action. This asserts that a certain characteristic (action) of the combined 
system in statistical equilibrium remains stationary for small deviations 
from equilibrium. This gives, as it were, a slight play at the joints of our 
systems, by which we can extricate them from the bondage of statistical 
equilibrium. After analysing a distribution in statistical equilibrium into a 
number of separate, but mutually perturbing, systems, we can give those 
systems a freedom which they did not possess as components of an equi- 
librium distribution. 

The introduction of some such principle is not a wholly arbitrary pro- 
cedure. Some such "loosening of the joints" is inseparable from the con- 
ception of the analysis of a whole into its parts.f It is meaningless to write 
a = b + c unless we contemplate the possibility that b may have a significance 
when c is not added to it. As a condition for detaching b from c, we must 
recognise a definite distribution of the characters of a between 6 and c. 
Thus an important aspect of the principle of stationary action is a localisa- 
tion of the characteristics energy, spin, etc. of the combined system. 

Let us return to the problem of determining observationally the change 

t Of. the introduction of an infinitesimal element of relative time dt r in 12-3. It is there 
postulated that the probability distribution of the combined system in phase space is 
stationary for such a variation. 



13-4] Standing Waves 241 

of energy with pressure of a gas. To render the conditions static and thereby 
amenable to treatment by the statistical theory which contains our funda- 
mental definitions, it was necessary to enclose the gas in a vessel. But the 
enclosure introduced compensating tensions which frustrated us. Having 
introduced the envelope, we were unable to detach the gas from its envelope. 
As an alternative we may consider a small volume of gas in the interior of 
a star, and seek to determine by ideal observations how its energy would be 
changed by a change of pressure. The question is absurd; we cannot change 
the pressure at one point in a star (in steady conditions) without altering the 
whole star. Observation will not tell us what part of the whole change of 
energy is located in the particular volume considered. 

To attach an observational meaning to a local association of energy 
density and pressure, we must be able to produce a change whose effects are 
confined to the locality. Such a change is provided by the conversion of 
radiation into molecular motion of matter. According to (13-34), if the 
radiant energy is converted wholly into standing waves, as in an ideal mon- 
atomic gas, there is no change of energy density or pressure;! 8O that the 
effects are confined to the locality. In particular the gravitational field 
emanating from the region is unaltered, so that there is no cause of readjust- 
ment of the matter outside. But there is the preliminary objection that the 
existence of radiation in any other than its equilibrium proportion pre- 
supposes a highly disturbed state of the star. Thus even the conversion of 
radiation into molecular motion is not a strictly local phenomenon. Its 
effects are local, but its causes are not local. It cannot occur independently 
of a general settling down of the star, implying similar conversions in other 
regions. 

It is here that the principle of stationary action, which must now have 
the definite form (13-34), steps in. It asserts that 8 T = applies to the local 
conversion of radiation into material energy, independently of the con- 
versions occurring in other parts of the star. Although the rest of the star 
could not actually be in statistical equilibrium, it may be treated as if it 
were in statistical equilibrium; because the property with which we are 
concerned is stationary for small deviations from statistical equilibrium. 

The mathematical form of the action principle shows quite explicitly that 
it is a means of localising the characteristics of the universe. The quantity 
to be varied is an integrated quantity covering a large volume; the quan- 

t This appears to be inconsistent with the elementary formulae, which predict a change 
of pressure if the volume is unchanged; hut in our problem we have to admit whatever change 
of volume is necessary in order that the effect of conversion may be strictly localised, i.e. 
that there may be no change of the gravitational field outside the region. But a real change 
of volume would displace the surrounding matter outwards or inwards; the change must 
therefore be represented as a change of reckoning of volume, implying a change SA of the 
gauge-constant. 

ETP 16 



242 Physical Applications [13-4 

titles determined or defined by the variation are functions of position so 
that each minute part of the system is described separately. Localisation is 
an artificial conception in an interrelated universe, where the influence of any 
one part extends through the whole. To detach one part from the rest is 
impossible. The action principle reassures us by asserting that (with suitable 
safeguards) a mild impossibility is a permissible idealisation. Its ill effects 
are of the second order of small quantities. 

Like many of the "principles" in science, the genesis of the action prin- 
ciple is that, having realised that we ought not to make a tempting assump- 
tion, we erect a principle to say that we may make it. I shall not attempt to 
defend the principle of stationary action as a physical assumption; there is 
no need to do that. Its defence is that it is the basis of our definitions. A 
principle of localisation must precede the definition of any localised entity, 
e.g. the g^ v or F^ of field theory. By definitions I here mean, not the mathe- 
matical definitions as symbols in a deductive theory, but the definitions by 
which they are recognised and measured in observational science. The action 
principle is not a physical hypothesis; it is a means of defining localised 
quantities. Thus in the application in 13*3 we had an undefined disposible 
constant A. By adopting the action principle ( 1 3- 34) we remove the arbitrari- 
ness of 8A, and changes of energy and pressure are defined precisely. 

In considering the action principle as a vehicle of definition, it is desirable 
to distinguish between : 

(a) Weak action principles. Referring to conditions in empty space, with 
or without weak electromagnetic fields. 

(6) Strong action principles. Referring to conditions in continuous matter, 
intense electromagnetic fields, the interior of a nucleus or electron, etc. 

The difference arises because, although an investigator has a certain 
amount of freedom in adopting definitions of the terms which he employs, 
he must have regard to established usage. In regard to (a) established usage 
dictates the form of the action principle. There is no difference of opinion 
as to how the kernel of the action invariant is constituted, to the order of 
accuracy required in weak conditions; for it has to lead to definitions already 
recognised. In regard to (6) an investigator has entire freedom in his choice 
of action invariant and consequent definitions of g^ v and F^, provided only 
that it converges to the weak action invariant as weak conditions are 
approached. Current literature shows that full advantage is taken of this 
freedom ! 

Observational measurements are only made under weak conditions. The 
"distance between two points" in field-free conditions is what it is deter- 
mined to be by measurement; the distance between two points in an intense 
magnetic field is whatever (within reason) a theorist chooses to call it. No 



13-4] Standing Waves 243 

experimental physicist would attempt accurate measurement of distance 
in an intense magnetic field; he would mistrust his scales or other apparatus. 
In practice he does not measure the curvature of the track of an electron 
in an intense magnetic field; he measures the curvature of a track on a 
photographic plate placed outside the field. Intervals, and the corresponding 
g^, are only defined observationally in field-free conditions, or when theory 
(that is to say, the accepted action invariant) assures us that the electro- 
magnetic field is not strong enough to introduce sensible error. When once 
we begin to suspect that our measured distances need correction, we turn 
to theory to tell us what the measures ought to have been. Naturally the 
corrected measures will confirm the theory used for determining the cor- 
rections, whatever form that theory may take. 

The literature of mathematical physics abounds in proposals of universal 
action principles covering strong as well as weak conditions. I think it is not 
unfair to summarise the majority of such proposals as follows: 

(1) A geometry based on a new set of axioms is outlined. 

(2) A fundamental invariant of the new geometry is selected as action 
invariant. 

(3) It is shown that the action principle leads to the usual equations for 
weak fields. 

(4) Second order terms, sensible in strong electromagnetic fields, are 
found, which it is suggested may provide an observational test of the theory. 

(5) A hope that something further may come of it. 

Our comment is: (1) We may choose any kind of space we please for the 
purpose of graphical representation of physical quantities without com- 
mitting ourselves to any thing, f (2) The quantities g^, Jf^ are to be used in 
strong conditions in which there is no agreed definition as to how they are 
to be measured. The action invariant embodies the definition selected in the 
new theory. (3) This shows that the definition is not in conflict with any 
established usage. (4) The result of the test will inevitably be a triumphant 
verification of the new theory, provided that the observations are correctly 
reduced. By correctly we mean that, for measurements with apparatus 
actually located in the intense fields, the readings of the apparatus (which 
are, of course, affected by the field) are corrected to accord with the readings 
of ideal apparatus which conforms to the equations of the new theory; and 
for measurements made outside the fields, the inference as to what is 
happening within the field is calculated according to the equations of the 
new theory. 

The trouble about unified field theories is that there are so many of them, 
and all of them are right. The various action principles are various plans of 
t Mathematical Theory of Relativity, 83. 

lC-2 



244 Physical Applications [13-4 

localising the characteristics of the interrelated conditions which the 
universe presents to us. The procedure of localising an entity is inseparable 
from the procedure of defining a local entity. It all reduces to a question of 
definitions. The difference between the rival theories really rests on (5) 
as to which opinions will naturally differ. 

13-5. Potential Energy. 

Let us return to the problem of the rectangular block. In order to avoid 
inessential complications, we shall at first suppose the density to be very 
low, so that collisions of the particles can be neglected. From one aspect the 
block is occupied by the set of standing waves introduced in 13-2. From 
another aspect it is occupied by n elementary particles moving in diverse 
directions; these are represented by progressive waves which combine into 
an w-tuple wave function. I think it will save much confusion of thought if 
we notice that the standing wave function represents a system in equi- 
librium, and the n-tuple wave function represents a rapidly dispersing 
system. If we insert in the n-tuple wave function a later value of the time, 
it gives the distribution which would be reached if each particle continued 
to move with its present momentum. 

Let us then take a somewhat modified ra-tuple wave function Y, which at 
every time t represents the n particles occupying the block at that time. 
This will represent a statistically steady system, just as the standing wave 
function does. Every time a particle crosses the boundary T changes dis- 
continuously.t We must replace the discontinuities by continuous change; 
otherwise the function is not differentiate and is useless for computing 
energy. But the energy id/dt of this smoothed wave function will be 
altogether different from that of the instantaneous n- tuple wave function. 

The general effect of smoothing out the discontinuities can be seen by 
considering the combined wave function of the rectangular volume A and 
the surrounding matter B which holds it in equilibrium. The steady state 
(represented by standing waves) results from the fact that on the average 
for every particle passing from A to B 9 a similar particle passes from B to A . 
In the combined wave function of A and B this is merely a nominal inter- 
change. The coordinates x p of the pth particle of system A have become 
inappropriate to system A, and the coordinates x q r of the qth particle of 
system B have become inappropriate to system B\ we therefore re-label the 
particle at x p as the qth particle of system B, and the particle at x q ' as the 
pth particle of system A. By the well-known Fermi-Dirac rule this inter- 
change of labels reverses the sign of the wave function in this case the 

t Since we are supposed to have approximate knowledge of the momenta of the particles, 
we cannot know their positions exactly. Thus the time of crossing the boundary is indefinite 
to a slight extent; and the change of T though erratic is not strictly discontinuous. 



13-5] Standing Waves 245 

combined wave function of A and B. In time-averaging the succession of 
reversals of sign is replaced by a continuous factor e'^', which reverses 
the sign at regular intervals irjp v . When we form the energy -i9/9f, 
the factor e ip ** gives an additional energy p v . This interchange energy 
does not appear in the instantaneous n-tuple wave function which repre- 
sents a dispersing system; but it belongs inalienably to the steady system 
represented by the modified function Y or alternatively by standing wave 
functions. 

Interchange energy is evidently potential energy. (I may add that so far 
as we know all potential energy is interchange energy.) The system occupy- 
ing the rectangular block may be kept in equilibrium, either by interchange 
of particles at the boundary, as described above, or by a force exerted by the 
matter beyond the boundary turning back the particles as they reach it. 
But the two processes are the same. For, since the particles are indistin- 
guishable individually, it is meaningless to discriminate between an old 
particle being turned back and a new particle coming in. Thus the inter- 
change effect and the force are identical. If we happen to take one view we 
call the energy interchange energy; if we take the other view we call it 
potential energy in the field of force. 

The rectangular block, which we have been contemplating, must not be 
too small to be treated macroscopically. In many applications the free path 
of the particles will be small compared with 1 19 1 2 , 1 B . The collisions are then 
the main instrument in preventing the particles from straying outside the 
block; the reduced number which hover about the boundary are deemed to 
be turned back by interchange as before. We shall find in Chapter xv that 
the electrical forces controlling the encounters of protons or electrons are 
attributable to interchange. For our purposes it is unimportant whether 
the potential energy included in !T 44 is due to collision interchange or to 
boundary interchange. The formal difference is that collision interchange is 
provided for in the equations and definitions of electromagnetic theory, so 
that it is an intrinsic property of electrons and protons as currently defined; 
boundary interchange is an averaging adjustment which enables us to treat 
a dispersing system as a static system. We may say that when the free path 
is small compared with a macroscopic volume element, the volume element 
is genuinely in equilibrium; when the free path is long, it can be treated as in 
equilibrium. 

When the velocities of the particles are not too great, and at the same time 
the density is too low to give rise to degeneracy, the classical theory applies 
and the density of the kinetic energy is f P. To make up the total p + 3P, the 
potential energy must be f P. Thus the kinetic and potential energies are 
equal. The fact is that the waves represented by scalar wave functions are 
the ordinary elastic vibrations or sound waves of the material. The only 



246 Physical Applications [13-5 

point at which we go beyond classical dynamics is when we apply the exclu- 
sion principle. 

The exclusion principle sets an upper limit to the amplitude of any 
particular wave, such that its energy cannot exceed one quantum. In non- 
equilibrium conditions the amplitude of a particular wave can greatly exceed 
this limit, e.g. when an organ pipe is sounding its fundamental note. This 
emphasises the fact that there is no justification for applying the exclusion 
principle in non-equilibrium conditions. It is intimately connected with the 
interchange forces which maintain equilibrium, as will be seen in Chapter xv. 

In extreme conditions the kinetic and potential energies are no longer 
equal. For constant density, it appears that the energy 3P is wholly poten- 
tial at absolute zero (complete degeneracy) and tends to become wholly 
kinetic at very high temperatures. We should first make sure that this 
separation into kinetic and potential energy has an observational meaning. 
Certain phenomena, e.g. the rate of disruption of nuclei by protons, evi- 
dently depend on the kinetic energy of the elementary particles (not the 
composite average particles); and since the rate of disruption increases 
rapidly with the speed of the protons, the distribution law of kinetic energies 
of the individual protons is involved, and doubtless will in due time be 
ascertained from observations of this kind. The problem of finding this 
distribution law theoretically should be soluble; but apparently it has not 
yet been solved. The individual protons and electrons correspond to pro- 
gressive waves; the present investigation treats only the composite particles 
represented by standing waves, and is not relevant. 

Reference must be made to the current formulae (associated with the 
"relativistic " degeneracy theory) which profess to give the distribution law 
of the kinetic energies and momenta of the individual protons and electrons 
at all temperatures. So far as I know, they may be right at high tem- 
peratures; but it is impossible to trust them, not only because they rest on 
a false conception of relativity, but because they are clearly wrong at low 
temperatures. In these investigations the minimum pressure P which 
corresponds to absolute zero is attributed to kinetic energy of the elementary 
particles. If that were so the protons would, if the density were great enough, 
still have large velocities at absolute zero, and continue to disrupt the 
atomic nuclei. This surely is a contradiction of thermodynamical principles. 
Our own investigation does not discriminate between kinetic and potential 
energies; but coupled with the general physical principle that processes of 
the nature of ionisation or transmutation must cease at absolute zero, we 
infer that the energy 3P must be wholly potential. 

It may be urged in defence of the current formulae that the energies and 
momenta to which they refer are not to be taken as the energies and 
momenta for the purpose of calculating collision effects that they are not 



13-6] Standing Waves 247 

to be interpreted according to the classical conception of motion. To this 
we reply: (1) Then the distinction between kinetic and potential energy 
loses all observational meaning; and we lose nothing by treating them 
together in our own theory. But the question still remains unanswered, 
What is the distribution law of the energies and momenta concerned in 
collision effects? (2) Our criticism of the current theory is that it does 
interpret the energy |P according to the classical conception of motion; it 
treats the corresponding velocity or momentum as a vector to which Lorentz 
transformations may be applied, and thereby introduces change of mass with 
velocity. If the particle has one velocity for the purpose of Lorentz trans- 
formations and another velocity for the purpose of collision phenomena, 
what becomes of the principle of relativity? 

13-6. Transition to Vector Wave Functions. 

b 

For the problem of the rectangular block we have used a form of wave 
mechanics much simpler than that which has been developed in previous 
chapters for the treatment of microscopic problems. If it is asked why the 
simple form does not apply to an atom, we answer that it does not even 
apply to a macroscopic distribution of matter in a smooth spherical vessel. 
In addition to the modification of the elementary orthogonal functions 
^n lf w 2 ,w 8 to correspond to a spherical boundary, a new feature appears. 
There are steady states in which the distribution has a resultant angular 
momentum. 

With a spherical boundary unidirectional motion is no longer inconsistent 
with a steady state. In rectangular coordinates there must be on the 
average as much motion in one direction as in the other direction of f a , and 
the state of motion is accordingly represented by standing waves; but in 
polar coordinates it is not necessary that there should be as much motion 
in one direction as in the other direction of 0, so that we may have pro- 
gressive waves in an angular coordinate 0. Such waves are not essentially 
different from the progressive waves representing a particle moving freely 
in space, i.e. revolving unidirectionally about the centre of curvature of 
space. 

The advent of progressive waves into a problem depends on the existence 
of relativity transformations of the coordinates, which occasion a degeneracy 
of the ordinary steady state solutions. In the problem of the rectangular 
block there are no relativity transformations of the internal coordinates 
M ; taking account of boundary conditions, no other axes are "equivalent" 
to those which are parallel to the edges of the block. In the spherical problem 
all orientations of the internal rectangular axes are equivalent; so that 
rotations in the three coordinate planes are relativity rotations. The time 
direction must be chosen so that the spherical boundary is at rest; it is 



248 Physical Applications [13-6 

therefore still unique, and there are no Lorentz transformations of the 
internal coordinates. 

The machinery for dealing with relativity transformations has been 
developed in our early chapters; and in order to provide for the relativity 
transformations of the internal coordinates we must introduce it into the 
internal wave functions. But for three-dimensional relativity it is un- 
necessary to resort to Dirac's fourfold wave vectors. We can take $, <f> to be 
wave vectors with two components (2-vectors), and associate the coordinate 
planes with Pauli matrices 1? 2 , 3 . As explained in 3-8, 19 2 , 3 , i form 
a minor complete set, and are a simplified representation of the symbols 
E 12 , E 23 , EM, E 19 when the other symbols are not being used. 

Considering an elementary scalar wave, let a be the particle density, or 
the probability that the particle belonging to the wave is within a unit 
volume at the point considered. If 0, <f> are normalised in accordance with 
the exclusion principle (13-144), <r = 0<. Thus scalar wave functions relate 
to the special case in which the stream strain vector /S = ^&* consists of a 
single algebraic component a. But when $ and are 2-vectors, an algebraic 
quantity a cannot be a simple product ^*; a is the sum of two pure com- 

P nent8 



where L is the Pauli matrix associated with an arbitrary plane. The two 
expressions in (13-61) are idempotent when normalised (so that the halfspur 
= ); hence they can be factorised ( 5-6). 

Accordingly the "particle" corresponding to an elementary scalar wave 
is replaced by two sub-particles with the non-algebraic stream vectors 
(13-61). The components Jo- and ^icr^ correspond to energy and angular 
momentum. Thus the sub-particles have opposite spin in the ^ plane. When 
the boundary conditions are such that the angular momentum in a steady 
state is zero, the oppositely spinning particles are constrained to occur 
together; they can be regarded as together constituting an indivisible unit 
a scalar particle; and we are only concerned with the sum of their strain 
vectors which, being an algebraic quantity, is represented by scalar wave 
functions. But when, as in the spherical problem, integrals of angular 
momentum exist, the oppositely spinning particles have independent 
probability distributions and are represented by separate 2-vector wave 
functions. 

The analysis of a system into elementary states has to be considered in 
conjunction with the perturbations anticipated. The elementary state is a 
unit which requires a separate probability factor in perturbation theory. 
In earlier chapters we have had in mind well-isolated systems, whose reaction 
with other systems is limited to weak or occasional perturbations. But in 
these macroscopic problems the "system" described by the wave functions 



13-6] Standing Waves 249 

is usually a volume more or less arbitrarily carved out from its surroundings, 
and the viscous forcesf at the boundary constitute a strong permanent 
perturbing influence. We have different elementary states of a gas con- 
tained in a spherical vessel according as thesphere is rough or smooth, because 
in the rough sphere the permanent perturbation is such as to prevent any 
persistence of probability in one direction of spin as compared with the 
other; there is no occasion to give them separate probability factors. 

The choice of elementary states depends on the approximate rather than 
on the exact boundary conditions. A vessel cannot be perfectly smooth; and 
if we paid attention to the exact boundary conditions, we should conclude 
(correctly) that the only state of statistical equilibrium was one of zero 
angular momentum. But the gas may take a long while to reach this state; 
and we learn more by utilising the complete series of states in an ideally 
smooth vessel, treating the small friction as a perturbation which slowly 
transfers the excess probability of one direction of spin to the opposite 
direction. 

The problem of a smooth spherical vessel is a half-way stage between the 
rectangular block and a particle moving freely in space-time. For the free 
particle the relativity transformations are extended to include Lorentz 
transformations. This additional degeneracy must be provided for by a 
second set of Pauli matrices 0^, commuting with the ^. The two com- 
ponents of opposite spin ^<r(li^) are in turn analysed into two com- 
ponents of opposite electrical sign 

ior(lig(li^). (13-62) 

The wave vectors are now 4-vectors, and the double set of Pauli matrices 
is more conveniently replaced by a set of Dirac matrices with the same 
commutative relations (3-8). The four components (13-62) then become 
the spectral components given in (6-64). 

It is of interest to consider whether (13-62) has any application to macro- 
scopic systems. We might start with 0^ instead of ^, and divide the scalar 
density a into two components of opposite electrical sign Jo*(l tfljj. The 
scalar wave function postulates that these are constrained to have equal 
probability so that they need not be considered separately. Can the 
boundary conditions be modified so that they have independent probabili- 
ties? I think so. The necessary condition is that the system shall be in- 
sulated. With an insulated boundary, there are steady states having an 
excess of components of one electrical sign; just as with a smooth spherical 
boundary there are steady states having an excess of components with one 
direction of spin. 

t The boundary pressure (interchange force) is allowed for by including potential energy 
in the system, so that we avoid treating it as an extraneous perturbation. 



250 Physical Applications [13-6 

If the substance is permanently magnetisable it is also possible to have 
steady states with a preponderance in one direction of the magnetic stream 
vector. 

These states of electric or magnetic excess may also be induced by 
permanent perturbation from outside, i.e. by steady electric or magnetic 
fields. Thus the general problem of statistical equilibrium of a macroscopic 
system, when the angular momentum is not constrained to be zero and 
electric and magnetic fields are not excluded, will involve Dirac's vector 
wave functions. 

When the spins and charges are not balanced, the state of the material is 
only imperfectly described by the ordinary energy tensor. For a full 
description we require the Riemann-Christoffel matrix ( 11-4). But it should 
be remembered that the electric and magnetic fields which occur in practice 
in macroscopic systems correspond to an entirely trivial excess of com- 
ponents of one sign. So that whilst these phenomena are important on their 
own account, there is very little to represent them in the statistics. It is 
therefore more suitable for ordinary purposes to represent the electro- 
magnetic field separately by potential theory, rather than to merge it in 
the general statistical formulation where it would be almost lost. 

If we apply the energy operator (13- 12) to vector wave functions, the 
covariant differentiation will introduce matrices ( 8-3). Each component of 
the energy tensor therefore becomes a matrix, and we obtain altogether 
256 components as when the Riemann-Christoffel matrix is used. It appears 
therefore that the theory of the energy of distributions which involve vector 
wave functions, treated in Chapter xi, could be reached by starting with the 
energy operator (13-12). 

13-7. Origin of the Energy Operator.f 

In this section we shall obtain the formal connection between the energy 



and the energy operator 

T aj8 

By contracting the energy tensor we have 

T = (1/8 
Using this to eliminate A, we obtain J 



t As this section is occupied with ordinary tensor calculus the summation convention is 
used unrestrictedly. 
J I was led to consider #p J0 a p# through correspondence with Dr J. Ghosh. 



13-7] Standing Waves 251 

Consider the gauge-invariant curvature tensorf 

*#tf = #*/*- 3 (*J0 + (^ 
which is invariant for the transformation 

9*fi=Pffafi> K a ' = * a + a0/a* a , (13-722) 

where j8 is any scalar function of the coordinates, and 

= logj8. (13-723) 

Multiplying (13-721) by gr a #, we obtain 

*G=a-6(K*) e + 6K K e . (13-724) 

Hence ^~^^ = *^-^*^ + 2 K^-^ a j3^^ ) 

+ 2{(Oj-iM JC *)> + * l *' (13-725) 
where ^ aj8 = (/c a )^~(/c ]3 ) a . 

Take * a =0 initially; and let A denote an increment produced by the 
gauge transformation. Then, since *# a fa/ap* & + &&$ is invariant for the 
transformation, (13-725) gives 

Wtf-tartW-WJt-tartW + WJfi-tatfP).}, (13-731) 
where a denotes dO/dx^. 

If 8 denotes covariant differentiation 

l 
p 

so that (O/J-^W&fcfcgjB-fyfl.. (13-732) 

(i3>74) 

so that (0J j8 + d a ^=-47rK.)8- 1 T a/ ,/9 (13-751) 

and, by contraction, (0 e ) e + fl0, = - 4 . jS^T/S. (13-752) 

Then, by (13-71), (13-731), (13-751), (13-752), 

A(? T aj8 -|gr a ^) = ]8- 1 (T^-i^T) j 8. (13-753) 

We now introduce wave functions 

*-jB, X = J8-S ' (13-76) 

representing an addition to, or modification of, the distribution whose 
energy tensor is T^ . Our result shows that instead of expressing the modi- 
fication as a change of curvature of space-time (which would involve altering 
the metrical tensor gr aj8 to jS 2 ^), we can express the change of T^ ^g^T 
as the expectation value of an operator T a ^-|gr a ^T, the operator being 
defined by (13-74). 

This is the most primitive connection between the wave functions and 

t Mathematical Theory of Relativity, equation (87-5); the suffixes after the brackets denote 
covariant differentiation. Although this tensor is generally studied in connection with Weyl's 
geometry, we do not here apply it in that way. We use only the well-known analytical 
property of invariance of the tensor. 




252 Physical Applications [13-7 

operational momenta of wave mechanics and the curvature tensors which 
represent momenta, etc., in general relativity theory. 

Equation (13-753) can be resolved into separate equations: 

Afy= X T^ f Ato*T)-j^. x Tfc (13-77) 

provided that the change AA of the arbitrary constant A is suitably chosen. 
The two equations are consistent since 

A((^T) = A(<7 a ^ 

<7 a 0gr being unaltered by change of gauge. The second equation of (13-77) 
is therefore derivable from the first. 

The wave functions employed in practical applications of wave- 
mechanics are an adaptation of these primitive wave functions to specially 
simple conditions. In particular a modified energy operator is used, defined 
by V 32 

' (13 ' 78) 



whose eigenvalue is the integrated energy over a three-dimensional domain 
of volume V. Instead of ^r, x we take two equal wave functions 

A=H0. 

The product \^<j> bears the same kind of relation to ^^ that a strain vector 
bears to a space vector. Instead of (13-77) we have 



If now <f> and \f> are normalised so that f<f>ifidV= 1, we have 



and V . A T^ is regarded as the energy of a single particle distributed through- 
out V with a probability density <fn/t. 

Further light on the connection between wave functions and gauge 
transformations will be found in 14-1, where a special case is treated. 

Although we do not normally employ Weyl's representation of the 
electromagnetic field in wave mechanics, it is of some interest to extend the 
foregoing investigation to his theory. The difference is that we can no longer 
put * a = initially, since the electromagnetic force JP a ^, which is invariant 
for the gauge transformation, is represented by curl /c a . The term 



in (13-725), being non-linear, will give cross terms 

2( 
or, in operational form, 



), (13-79) 

representing a mutual energy of the waves and the electromagnetic field, 
which is not included in (13-74). 



13-8] Standing Waves 253 

13*8. The Degeneracy Formula. 

Since the formula (13*27) for the minimum pressure differs from the so-called 
relativistic degeneracy formula which has been widely but uncritically 
accepted, we shall here explain the origin of the difference. 

If a particle of proper mass m has a velocity vector dx^/ds, its "integrated 
energy tensor" is defined to be 

Uv>-i&^-. (13-811) 

ds da v ' 

Suppose that there is a discrete set of values of UP V , and that there are n r 
particles with energy Iff in a three-dimensional volume V. A fractional 
value of n r is to be interpreted as a probability. The resultant energy tensor 
(reckoned as usual per unit volume) is 

Tfi v = Z> r n r Us v IV. (13-812) 

Let N r = n r (dtlds) r . (13-82) 

Then N r /V is the proper density of distribution of the particles with suffix r 
as contrasted with the relative density n f /V. The energy tensor can also 



where M 

The exclusion principle provides an upper limit to the number n r or N r . 
The difference between the two theories is 

Present theory n r < 1, j 

Current theory N r ^l.]~ ( ' ' 

It will be seen that, if V = 1, the maximum contribution of any one state to 
TP V is on the present theory m (dx^/ds) (dx v /ds), and on the current theory 
m (dXp/ds) (dx v /dt); or in terms of momenta the contributions are ppp v /m 
and P^P V !PQ . In particular the contributions to the energy are, respectively, 



It is clear that N r has been used in current theory under the impression 
that (13-812) is a tensor equation. If each term on the right-hand side were 
a tensor, n r /V would be invariant for Lorentz transformations, and N r would 
be invariant. But the eigenvalues Uf of a tensor operator do not constitute 
a tensor, and (13-812) is not transformable. 

When once it is realised that the calculation of P is concerned with the 
random internal motions relative to the centre of gravity of the whole 
block, so that the time-axis is prescribed, it is difficult to see any reason at 
all for the introduction of N r . For standing waves the momentum does not 
reduce to an eigenvalue, so that the number N r does not exist. 

Since N r m=n r M, the substitution of N r for n r is from one point of view 
equivalent to introducing change of mass with velocity. The pressure in a 



254 Physical Applications [13-8 

star is of order of magnitude p(f>, where <f> is the gravitation potential measured 
from the boundary. If all the particles are of the same mass, v 2 is of the same 
order as ^; and the change of mass with velocity |mv 2 /c 2 is of the same order 
of magnitude as the gravitational potential energy m^/c 2 , whose effect on 
the mass is neglected. It is, however, assumed in the current theory that, 
although v 2 is of order <f> for protons and ions, it is very much higher for 
electrons, which thereby contribute the bulk of the pressure. But there is 
no reason to suppose that in the degenerate state the electrons have higher 
speeds than the protons. Equipartition of energy refers only to transferable 
energy. 

Investigators seem to have been misled by trusting to the classical picture 
of moving particles. When a particle is forced up into a state of high energy 
by the occupation of the lower states, the current theory pictures the energy 
in the classical way as translation with high velocity. It accepts this so 
literally that it supposes that the particle could be reduced to rest by a 
Lorentz transformation, and thereby calculates its change of mass with 
velocity. But non-transferable energy cannot logically be represented that 
way. We take it that the degeneracy energy is potential energy. There is 
then no correction for change of mass with the (non-existent) velocity; and 
the exclusion principle must be n r < 1, since N r is undefined. 

The exclusion principle was first recognised in the atom where it applies 
to the internal motion of a system in a steady state. In generalising it to 
macroscopic systems it would be unsafe to disregard this restriction. In 
any case the problem to which we here apply it relates to an internal system 
in a steady state. If we are not obsessed with the idea that the formulae must 
somehow be capable of extension to dispersing systems represented by plane 
progressive waves, or that they must be invariant for transformations of 
axes which would change the nature of the problem, the procedure is 
straightforward and unambiguous. The steady state distribution is repre- 
sented by standing waves. The choice between n r and N r is settled by the 
fact that N r is not only irrelevant but non-algebraic. As shown in 13-5 the 
steady condition involves a potential energy due to interchange, and for 
complete degeneracy the energy is wholly potential. Finally, we shall be 
able to test the theory in Chapters xiv and xvi, by using it to calculate the 
constant of gravitation and the cosmical constant. 

Originally my choice of n r was derived from the investigation in 10-9, 
where it was shown that the relative energy of a particle is not p Q but 
a ( jpj 2 +p 2 2 +2> 3 2 )/m. Since the law connecting pressure and density in a gas 
depends only on the internal motions, which must be separated from the 
mass motion of the gas as a whole, I could not feel satisfied with the current 
theory which took the energy contribution due to one occupied state to 



13-8] Standing Waves 255 

The present theory gives P Q oc or* at all densities. The current theory gives 
P oc / (a), where/ (a) changes gradually from or* to <r* as the density increases. 
This is especially important in the theory of white dwarf stars (in which the 
degeneracy pressure becomes large), because it can be shown rigorously 
that the pressure in a star cannot exceed Bp*, where B is a constant depend- 
ing only on the star's mass.f The curve P = Kv* giving a minimum pressure 
must, as a and p increase, eventually cross the curve P=JB/>^ giving a 
maximum pressure. The crossing point is an extreme upper limit to the 
density in a star of given mass; this limit is of order 10 7 gm. cm.- 3 for the 
sun's mass. The current theory on the other hand gives an upper limit to p 
only in the smaller stars; above a certain mass the curves no longer cross, 
and it would seem that as the star's energy supply gives out, it must go on 
contracting to ever higher density until the space becomes so much curved 
that the terms "contraction" and "density" lose all meaning. 

t Monthly Notices, R.A.S. 91, 444 (1931). We may take the density p to be an approxi 
mately known multiple of the electron density a. 



CHAPTER XIV 
THE COSMICAL PROBLEM 

14-1. Waves and Curvature. 

Without appealing to the detailed investigations in Chapters xi and xin, it 
is evident that the curvature of space-time introduced in relativity theory 
and the waves of "0" introduced in wave mechanics are equivalent. Both 
devices are used for the same purpose, to represent the distribution of mass 
and momentum of physical systems. Both are devices] it is not suggested 
that either the curvature or the waves exist in a literal objective sense. The 
method of waves gives the finer analysis, and is essential if we are treating 
systems on the atomic scale; but for the mechanics of ordinary macroscopic 
objects, and for astronomical and cosmical systems, either method is 
available, and either method should give the same result. 

But the result will be expressed in terms of different natural constants. 
When the relativity method is used, the principal constant involved is the 
gravitational constant /c. Wave mechanics does not introduce K, but uses 
Planck's constant h and the masses m p , m e of elementary particles. If we 
can find one problem tractable enough to be solved by both methods, we 
shall, by comparing the two answers, obtain a relation between the natural 
constants. I cannot but think that the realisation that a hitherto un- 
recognised relation exists that there will be at least one redundant con- 
stant when the theories are brought together is scarcely less important 
than the ascertainment of its precise numerical form. 

In this chapter we shall solve one problem by both methods, and so 
ascertain the relation between the natural constants. The system which we 
shall consider is a self-contained static distribution of material particles 
(protons and electrons) without radiation. By "self-contained" we mean 
that it requires nothing external to itself to hold it in equilibrium. In 
relativity theory the only completely static self-contained system is an 
Einstein universe. In quantum theory a system, which is static and radia- 
tionless, must be in its ground state. We shall therefore treat an Einstein 
universe, (a) by the ordinary relativity theory, and (6) by wave mechanics 
applied to a system of particles in the ground state. 

As in the preceding chapter, we represent matter in equilibrium by 
standing waves. But there will now be no artificial boundaries, and the 
waves extend throughout the universe. 

In stereographic coordinates the line element of an Einstein universe isf 
<fo2 = -(i+ r */4R*)-*(dx*+dy* + dz*) + dt* (14-11) 

t Tolman, Relativity, Thermodynamics and Cosmology, equations (138-4), (139-3), (139-4). 



14-1] The. Cosmical Problem 257 

and the pressure and density are found by the usual method to be 



(14-13) 

Instead of treating jR as a radius of curvature, we treat ft as a gauge factor. 
That is to say, we treat the stereographic projection as the true configuration ; 
the curvature is abolished, and #, y, z are rectangular coordinates in a flat 
space. The "true" element of length is then ids = (dx 2 + dy 2 +dz*)l, which 
is j8 times the measured length ids. Hence the true volume is j3 3 times the 
measured volume, and the true particle density a is /?~ 3 times the measured 
particle density a. The measured density of distribution of particles in an 
Einstein universe is uniform; in the new reckoning this uniform density a 
is replaced by GP O = /J~ 3 <7. 

Expressing the particle density cr in the flat space as the product of two 
equal scalar wave functions 0, 0, we have 



Prom this we calculate the eigenvalue of V 2 at the origin 

V 2 =-9/4K 2 (14-141) 

so that, by (13-24), P = %lR*m. (14-142) 

To obtain the expectation value P, we must multiply the eigenvalue (at the 
origin) by the probability that the entity represented by 0, is in unit 
volume at the origin. The whole volume of space in natural measure is 

F=27r 2 J2 3 . (14-143) 

Since the distribution is uniform, the probability associated with any unit 
natural volume is 1/F. At the origin j8= 1, and the two reckonings of volume 
agree. Hence p8/4RmF, P = 9/4# 2 mF, (14-144) 

by (13-35). 

Since we have represented the whole density cr by one pair of wave func- 
tions, we have treated it as the probability distribution of a single scalar 
particle. The formulae may be regarded as referring to an ideal one-particle 
universe; or, if applied to the actual universe, m is the mass-constant of the 
equivalent single particle. The extension to a system consisting of N 
particles which obey the exclusion principle will be treated in 14-2. 

In Chapter xni only part of the energy was represented by standing 
waves, the rest energy being taken care of by the external wave function. 
But here the whole curvature of the Einstein universe has been replaced by 

ETP 17 



258 Physical Applications [14-1 

wave functions i/f, < in a flat space-time. Thus the density p = 3P in (14- 144) 
is the whole of the density. 
Setting p = 3P in (14-12) we obtain 

A = f#- 2 , 87rKp = fJ&- 2 . (14-151) 

Let us now revert to the usual standpoint, and regard JB as a radius of 
curvature. The waves are then abolished; they were a representation of 
the projection factor j3 which is no longer used. Since there are no standing 
waves, P = 0. Inserting P = in (14-12) we obtain 

A' = J2- 2 , STTKP' = 2JR- 2 , (14-152) 

accents being used to distinguish this from the previous reckoning. We have 

. p'=$P- (14-153) 

The change of reckoning from /> to p' depends on the change of the 
arbitrary constant in the energy tensor from A to A'. We have seen ( 13-1) 
that this signifies a change of zero point from which energy and pressure 
are reckoned. When we represent the curvature by waves we introduce a 
pressure, uniform throughout the universe, which (according to our usual 
outlook) is fictitious. The change of zero point gets rid of this pressure, and 
at the same time changes p to p. 

The pressure (in ordinary reckoning) is not necessarily zero in an Einstein 
universe. Static universes with non-vanishing pressure are obtained by 
taking A slightly greater than J?- 2 . But we are here treating a radiationless 
universe, which is accordingly at zero temperature, so that the pressure 

is zero.f 

By (14-144) and (14-163), />' = 3/72^7. The total mass or energy of the 
Einstein universe in ordinary reckoning is therefore 

M'=p'V=*IR*m. (14-161) 

There is further the well-known relation, obtained from (14* 152) and (14- 143), 



Thus if M' or R is given we can determine the mass-constant w, which 
occurs in the definition of the energy operator. 

If there are two reckonings of m as there are of p and M 9 viz. m 1 = fw, 
(14-161) can also be written ^ _ 4/^2^ (14-162) 

The origin of the factor f is that, since we are treating static conditions, 
the gauge transformation is applied in three dimensions only. The gauge 
transformation in 13-7, which gave the most straightforward formal 

f According to (13*27) the pressure at zero temperature is not strictly zero. For the 
Einstein universe this residual (degeneracy) pressure is found to be of order 10~ 26 p, and could 
be neglected. It is, however, the formula (13*27) which should be corrected by this small 
amount (to allow for the cosmical curvature); and the degeneracy pressure in the Einstein 
universe is strictly zero. 



14-2] The Cosmical Problem 259 

connection between waves and curvature, was in four dimensions. Con- 
sequently the four-dimensional volume element is changed in the ratio j8 3 
in this section, whereas it was changed in the ratio jS 4 in 13'7. Another 
example of this factor occurs in the two modes of reckoning magnetic energy 
(12-85) and (12-86), according as the field is strong enough to render it 
necessary to treat the problem in four dimensions or weak enough to be 
treated as a perturbing effect in a three-dimensional problem. 

14-2. Analysis into Particles. 

In the foregoing calculation we have treated the uniform density p of an 
Einstein universe as though it were the probability distribution of a single 
particle of mass-constant m. We shall now suppose that the space of radius 
B is occupied by N' elementary scalar particles. By the exclusion principle 
these will be represented by orthogonal wave functions, each particle having 
a separate wave function. To obtain the ground state of the distribution 
we must select the wave functions of lowest energy. 

The wave functions can be classified similarly to those of an atom, except 
that we are not troubled by the duplication of states due to opposite direc- 
tions of spin. (That is taken care of automatically when the N r scalar 
particles are replaced by 4JV' elementary vector particles.) In the atom the 
waves are concentrated into small volume by a controlling Coulomb field. 
Here the gauge factor /? plays the part of a concentrating force; or, what 
comes to the same thing, the waves spread to the natural limit imposed by 
the finitude of space. In the stereographic projection of the uniform sphere 
we have a concentration of density towards the origin and a thinning out at 
great distances, just as in the atom; we may look on it as an illusion of 
projection, but mathematically the effect is the same as if the distribution 
were being held together by a central force. 

The lowest state (K state) is the projection of a uniform^ spherical dis- 
tribution; and the investigation of 14- 1 is directly applicable to it. We need 
not treat in detail the successively higher states, which will be projections of 
spherical harmonics in four dimensions. We proceed at once to the highest 
of the N 1 occupied states, which we shall call the limit state. We denote the 
energy of the limit state by w 2 . For the actual universe the limit state 
corresponds to very high quantum number (about 10 26 ), so that the energy 
levels have there become practically continuous. For the ordinary problems 
of physics (excluding cosmical problems) there is an inexhaustible supply of 
particles with energies practically equal to the limit energy m 2 . 

The mean energy of a particle is 

m = fw 2 . (14-21) 

This is a general formula applying to all systems of orthogonal functions in 
three dimensions. For the rectangular waves, treated in the box problem, 

17-2 



260 Physical Applications [14*2 

it is contained in the formula S 2 = fr 2 (13-26); it there follows from the fact 
that the elementary wave functions form a lattice of uniform density in 
to-space. It is well known that the density of the lattice remains invariant 
when other orthogonal functions are used instead of Fourier functions, and 
when the controlling field of potential is varied; or, as it is usually expressed, 
each eigenfunction occupies a cell of phase space of volume A 3 . The only 
addition we need make here is that, since p = 3P for standing waves, the 
ratio between the mean contribution and the limit contribution to the 
pressure applies also to the energy density. 

In treating the energy levels in an atom, recourse is had to a device known 
as the selj '-consistent field. The electrons are put one by one, not into the field 
of the nucleus alone, but into the field provided by the averaged distribution 
of the whole system. We use the same principle in analysing the universe 
into particles. The whole distribution determines the radius R of the space, 
into which we put the particles one by one; and the controlling field j8 9 
which depends on R, is likewise determined by the complete distribution. 

Let us glance at the alternative of building up the Einstein universe 
synthetically. The first particle takes up a distribution of self-equilibrium 
a miniature Einstein universe consisting of one particle. We can calculate 
its energy by (14-162), since in this case Jf' = m'. But the calculation is of 
little value, since the K state in the completed system will have an altogether 
different energy. Each particle that is added, not only contributes its own 
energy, but modifies the energies of all the preceding particles. 

By the use of the self-consistent field the energies of the particles are 
defined in such a way that they are precisely additive. This is important in 
connection with the question, whether the whole density of the distribution 
is represented by X ift k <f> k or by (S $ k ) (S </> k ), $ k and </> k being elementary wave 
functions. In ordinary applications, such as the problem of the rectangular 
block, it does not matter; because, owing to the orthogonality of the wave 
functions, the cross terms yield zero integrals over the small volume that is 
being considered. But when the waves extend over the whole universe, the 
vanishing of the integral over the whole universe is not a sufficient reason for 
neglecting the cross terms locally. When we use the self-consistent field the 
density is fc < fc . For the wave functions, and the energy derived from them, 
are calculated on the supposition that all the particles are present and 
producing the field j8. The cross-energy with all the other particles, as well 
as the self-energy of the kth particle, is already incorporated in $ k <f> k \ and 
we do not require additional terms ift k <f> k > to represent it. 

This is the same outlook as in 12*6, where the mutual energy invariant 
of two systems is replaced by self-energy invariants. In fact the energy 
attributed to a particle must necessarily originate as a mutual energy of some 
kind. It is only when the distribution of the other particles differs from the 



14-3] The Cosmical Probkm 261 

standard distribution used in calculating the supposed self-energy, that the 
mutual energy terms have to be recognised explicitly .f 

A precaution is necessary is using the self-consistent field, if we are to 
avoid counting the energy twice over. In a field of fixed potential < the 
energy of a particle is reckoned as m^\ but in a field produced by other 
particles the energy is reckoned as Jw^, the other half being allotted to the 
particles producing the field. 

We may summarise the progress of ideas as follows. The mechanical 
properties of macroscopic matter forming an Einstein universe are usually 
represented by curvature of space-time; but we can, if we prefer, analyse 
it into particles and embody the mechanical properties in wave functions. 
Energy and pressure represented by wave functions must be omitted from 
the curvature (11-7); thus the complete wave representation consists of 
wave functions in flat space-time. We therefore consider an alternative 
representation in flat space-time obtained by stereographic projection of 
the distribution. In this representation there is a concentration towards the 
origin, as though a controlling force prevented the matter from spreading 
away to infinity. We analyse this density distribution into orthogonal wave 
functions, representing particles which obey the exclusion principle. All the 
particles must be fully present in the distribution, up to a limiting energy 
determined by the number of particles; otherwise there would be a conflict 
between the relativity conditions and the wave mechanical conditions for 
a static or ground state. Conceptually each wave function or state could be 
occupied or not; but actually the decision that the states shall all be occupied 
was taken at the beginning when we chose a radiationless static universe 
for projection. Thus the disturbance of a particle by the particles occupying 
other states has been taken into account from the beginning. The test of 
consistency is that the sum of the densities corresponding to each state shall 
reproduce the density originally postulated. 

14-3. Threshold Energy. 

In elementary quantum theory the system under discussion is always treated 
as an independent addition to the rest of the universe. If the quantum phy- 
sicist ever remembers that there is a "rest of the universe ", he treats it as an 
ideal background which will not interfere with his system. He does not 
picture it as consisting of N' other particles competing with his own particles 
for the states of lowest energy; nor does he contemplate the possibility of his 
own particles dropping into vacant levels in the background. The back- 
ground is treated as impermeable. 

This is equivalent to assuming that the "rest of the universe" is in the 

t Thus perturbation theory will involve (2ty ft ) (S# A ) since it deals with deviations from the 
distribution which furnishes the adopted self -consistent field. 



262 Physical Applications [14-3 

ground state; so that there are no excited particles to drop into the vacant 
levels in the added system, and no vacancies for the added particles to fall 
into. 

When another (scalar) particle is added to the system of N' particles in 
the ground state, the lowest vacant energy level is at the limit energy m 2 . 
The added particle must be endowed with an energy w 2 in addition to any 
transferable energy it may possess. Thus w 2 , which is the limit energy of the 
background particles, is the threshold energy of the added system. There is 
thus no interference between the added particles and the rest of the universe. 
The energy levels in the added system begin at w 2 ; the particles forming the 
rest of the universe are all placed below this threshold level. 

The threshold energy w 2 is the rest energy or proper mass of an added particle. 
This applies to scalar particles; it is modified in accordance with the theory 
of Chapter xn when we substitute charged particles. The added particle 
may be excited above the threshold level, and so possess additional (kinetic) 
energy. The transferable energy of a particle is really its energy of excitation 
in the universe-atom; though it is, of course, not usually regarded that way. 

Although the elementary equations of quantum theory postulate that the 
particles forming the rest of the universe are in the ground state, that does 
not mean that the theory is only applicable if the universe is an Einstein 
universe. Of necessity an elementary equation refers to idealised con- 
ditions. The ground state is the fixed standard background; and any devia- 
tion from the ground state must be explicitly described as an addition to 
the fixed background, and as such taken account of as part of the added 
system. Usually these additions are called "fields". 

For example, let the added system consist of particles in thermodynamical 
equilibrium at temperature T. Clearly there must be statistical equilibrium 
between the added particles and the background particles, so that the back- 
ground particles will be excited. But the excited background is not treated 
as such. It is described as a fixed impermeable background plus a field of 
radiation. Exchanges of energy between the added particles and the 
excited background are described as exchanges between the added particles 
and a field of radiation. 

Radiation is the name under which vacancies in the sub-threshold levels 
are taken into account in current theory. I can see no possible doubt about 
this identification. For in problems of this type radiation is the only entity, 
besides the particles, that is mentioned as an addition to the fixed back- 
ground. So that, if the vacancies are not taken into account as radiation, 
they are not taken into account at all which would clearly be a gross error 
rendering agreement with observation impossible. 

I think that this is the natural approach to a quantum-relativistic theory 
of radiation. But as radiation is rather apart from the main subject of these 



14-3] The Cosmical Probkm 263 

investigations, I have not found time to develop the theory. I do not suppose 
that progress will be easy; for it is a long step from radiation in an Einstein 
universe to radiation in the concentrated systems where it is studied in 
practice. That radiation is the positive aspect of the vacancies caused by 
excitation of the unspecified (scalar) particles seems to me indubitable; but 
what I now add is an unchecked first impression. 

Presumably a single vacant level constitutes a photon. The energy of the 
photon is that which is released when the photon is abolished by a limit 
particle falling into the vacancy, i.e. the depth of the vacant level below the 
limit energy. Since the. energy levels are practically continuous near w 2 , a 
photon can have practically any energy up to w 2 . I doubt if ra 2 is a genuine 
upper limit; further investigation would be required to ascertain whether it 
still applies in less idealised conditions. 

In non-equilibrium conditions the added particles may be localised as 
wave packets-, similarly the vacancies may be localised as voave pockets. The 
wave pocket seems to be nothing more than the partial localisation of a 
photon, manifested, for example, in the observation of individual X-ray 
effects in an expansion chamber. 

The particles which should have occupied the vacant levels will exist at 
or above the limit level. (A vacancy, in the sense of annihilation of a particle, 
is not a photon but an impossibility.) If a photon forming part of a field of 
radiation is absorbed by an atom, we can picture it as continuing its exist- 
ence in the same form inside the atom, namely, as a vacancy at one of the 
lower levels in the atom. In such exchanges the energy of the photon, 
measured by the depth of the vacancy below the limit up to which the levels 
are occupied, does not remain constant, since the ejected particle may carry 
off some of the energy. 

Leaving the subject of radiation,f we return to the added particles. The 
added system is formed by specifying certain particles either by wave 
functions or macroscopically. The particles to be specified must always be 
taken from the uppermost level, so as to leave no hole in the background. 
Since the levels near m 2 are exceedingly close together, there is for ordinary 
purposes a practically unlimited supply of particles of rest mass m a ; and the 
specified particles will have this proper mass. But when we extend the 
formulae to cosmical systems, we must ultimately come to the deeper 
strata. Finally, when we apply our calculations to the whole Einstein 
universe, and specify the whole of the N 1 particles (macroscopically), the 
total rest energy instead of being N'm 2 is 

M=N'ffi=%N'mt (14-31) 

by (14-21). 

t For a deduction that photons (as here identified) obey Einstein-Bose statistics, see 
16-3. 



264 Physical Applications [14-3 

The deficit f N'm% is the (negative) gravitational potential energy. 

Gravitational energy is therefore the exclusion effect treated in 14-2, 
but viewed from the highest level instead of from the zero level. Remem- 
bering that the unspecified particles have to form an impermeable system, 
each successive particle that is transferred to the specified system must be 
taken from a slightly deeper level. The rest energy of n particles is slightly 
less than n times the rest energy of one particle. When we come across this 
phenomenon observationally we attribute it to gravitational energy. We 
have here considered only the simple case of an Einstein universe, and have 
not entered into the complications due to concentrating the specified 
particles. That is scarcely necessary; for we are not putting forward a new 
"explanation" of gravitational energy. The relativity representation of a 
gravitational field as curvature of space-time was the starting point of the 
whole investigation. What we have now reached is the same phenomenon 
viewed from the topsy-turvy outlook of elementary quantum theory. 

Exclusion energy, interchange energy, potential energy are different ways 
of regarding the same thing. Up to the present we have been concerned only 
with gravitational potential energy; but in Chapter xv we shall find that 
electrical potential energy has the same origin. 

There is reason to think that our expanding universe is rather far removed 
from the Einstein state and that a de Sitter universe would be a better 
approximation. It may be suggested that this, whilst not invalidating the 
use of an Einstein universe as a standard background, may cause it to be 
inconvenient in practice. But the local conditions of our experiments always 
differ much more widely from those of an Einstein or a de Sitter universe 
than these do from one another. The density in an Einstein universe is 
3-32. 10~ 27 gm. cm.~ 3 The use of an Einstein universe as the standard con- 
dition, to which the exact equations refer, means that gravitation corre- 
sponding to this density has been allowed for in the equations. The actual 
field in any practical problem is always very much greater, and it is unim- 
portant whether the standard field is deducted or not. 

14-4. Positrons and Negatrons. 

The scalar wave functions, which we have been considering, can each be 
analysed into four vector wave functions representing protons and electrons. 
We may therefore have at any level a vacancy due to the absence of a proton 
or electron, instead of the absence of the whole scalar particle. We have seen 
that the absence of a scalar particle is represented as the addition of a 
photon. According to Dirac the absence of an electron is represented as the 
addition of a positron. 

I think that the present investigation throws new light on the meaning 
of the vast number of occupied negative energy levels, which Dirac postu- 



14-4] The Cosmical Problem 265 

lated in his theory of the positron. The levels are described as negative, 
because energy is now reckoned from the threshold energy as zero.f This is 
in accordance with the outlook of elementary quantum theory described 
in 14*3; the systems which it treats are additions to the fixed background, 
and the zero of energy is therefore transferred to the threshold level at which 
the additions start. Of these negative energy states, Dirac saysf 

These assumptions require there to be a distribution of electrons of infinite 
density everywhere in the world. A perfect vacuum is a region where all the 
states of positive energy are unoccupied and all those of negative energy are 

occupied The infinite distribution of negative -energy electrons doea not 

contribute to the electric field. 

We have reached an altogether different view of the way in which the 
negative energy levels are filled. They are occupied by the large amount of 
matter of the universe not specifically mentioned in the equations which 
Dirac was developing and applying. This matter is ignored, i.e. treated as a 
fixed background, in the idealised equations; and such treatment auto- 
matically relegates it to the levels below the threshold which it must fill 
completely. Of course, in the actual universe this is far from true; but the 
idealised equations are a far from complete representation of the conditions, 
and must be supplemented by terms representing fields of radiation, etc. 
The chief points on which we go beyond Dirac's theory of the positron are : 

(1) Our negative energy levels are occupied by protons and electrons 
equally. Dirac supposes them to be occupied by electrons only, with the 
result that there is an infinite negative charge to be suppressed in an 
arbitrary way. 

(2) Our theory places protons and electrons on the same footing, and 
therefore definitely predicts the existence of negatrons or negative protons, 
unless there is some unforeseen limit ( < 2m p ) to the amount of energy that 
can be expended in a single process of excitation. 

(3) The number of negative energy levels is not infinite; it is known 
precisely ( 14-7, 16-8). 

When an electron annihilates a positron, the sum of their energies 
( ^ 2rn e ) is emitted as radiation; In Dirac 's theory it is assumed that this 
is equal to the difference of energy level, as though the falling of the 
electron into the vacant level were the same kind of transition as that 
which gives a spectral line of an atom. But in the annihilation of an 
electron and positron in free space, two photons are emitted. The process 

t This, however, is not the negative energy to which Dirac refers. A positron has rest 
energy m e , and he therefore supposes that the particle whose absence it represents would 
have had rest energy m e . As explained at the end of this section, we do not accept 
this conclusion. 

J Quantum Mechanics, 2nd ed., p. 271. 



266 Physical Applications [14-4 

is therefore not comparable with an ordinary transition in which the whole 
energy is emitted as one photon. 

In our theory the depth of the vacant level below the zero level gives 
only the excess of the energy of the positron above its rest energy m e . 
A vacancy just below the limit level corresponds to a positron almost at 
rest. We have seen ( 6-7) that owing to the idempotency of the stream 
vector of an elementary particle, the vector J representing charge, current, 
etc., is often confused with the vector J 2 representing energy, momentum, 
etc. The absence of an electron whose stream vector would have been J 
is equivalent to the presence of a positron with stream vector J\ the 
reversal of the charge and current is thus duly indicated. But the energy 
and momentum of the electron and positron are J 2 and ( 7) 2 , so that 
they are the same; in particular, the quarterspur which represents the 
proper mass is the same. Our later developments of the theory of the 
origin of mass have thrown some more light on this point. We recall that 
energy is furnished to a system by specifying it (11-6), and that the 
energies m p9 m e are additions made at one stage of the specification, 
viz. when a neutral particle is specified as a particle with definite charge 
and spin. So far as this addition is concerned, it makes no difference 
whether the original neutral particle had a positive or a negative existence, 

14*5. Masses of the Added Particles. 

It is useful to distinguish two systems of application of wave mechanics: 

A. Cosmical System. This treats the specified and unspecified particles 
as one great system of N' particles. Owing to the exclusion principle the 
particles occupy various energy levels; and in the ground state their energies 
extend up to a limit m 2 . 

B. Local System. The zero of energy is moved up to the level m 2 , and the 
local system is an addition above that level. On the negative side of the new 
zero there remains the completely filled set of levels (now negative) which is 
never disturbed. This is variously regarded as a comparison fluid, an im- 
permeable background, a pure inertial field, or a static spherical space, on 
or in which to erect system B. 

A disturbance of system A from its ground state necessarily involves 
defect below the limit level as well as excess above it. But nevertheless in 
system B the levels are always assumed to be completely filled; and the 
defect as well as the excess is represented as an addition above the new zero. 
Suitable entities, such as radiation, positrons, electromagnetic potential, 
are introduced to represent the defects when considered as additions. In 
Newtonian theory an irregular gravitational field is also regarded as an 
addition to the fixed background. If we follow Einstein's theory, it is not an 
addition to but an abandonment of the fixed background. A space of variable, 



14-5] The Cosmical Problem 267 

and in general non-static, metric is substituted for the uniform static 
background. System B is then inapplicable and it is necessary to revert to 
system A. 

We shall now consider more closely how the electrons and protons of the 
local system are created as an addition to the fixed background. We take 
a neutral particle (a quarter of a scalar particle of system A) at the limit 
energy, and "specify" it. That is to say, we assign to it wave functions 
giving its probability distribution as modified by incorporating observa- 
tional information as to its charge, spin, momentum, position, etc. As a 
neutral particle it was equally likely to be a proton or an electron; we now 
specify it definitely as an electron, say. This involves the use of vector wave 
functions. The vector wave functions also make the particle mobile in the 
ordinary sense, i.e. characterised by a stream vector to which Lorentz 
transformations are applicable. The scalar wave functions can only express 
mobility of the type represented by standing waves. 

Let us now regard spherical space as the frame, the original probability 
distribution of the neutral particle as the partial comparison fluid, and the 
specified distribution as the object particle. The observable relations are 
contained in the double wave vectors T, X of the combined system. It is 
in these that the observational information is directly incorporated; for 
example, our observations create wave packets in *F, X. Current wave 
mechanics replaces these by simple wave vectors ^, x for the object particle 
and scalars ^, o> for the comparison fluid. As shown in Chapter xn this 

imposes the relation a ,,,*,, v 

* 10m 2 -136ww + m 2 =0 (14-51) 

between the mass m of a proton or electron in the local system and the mass 
m of the neutral particle. 

Remembering that the addition of two systems is represented by multi- 
plication of their wave functions, we notice that whereas V describes a 
modification of the distribution, $ describes what has been added by the 
modification. The reduction from T to is really a casting out of the original 
background particle. If we define an electron or proton to be the entity re- 
presented by ^r, it is a pure addition to the background. Accordingly the 
simple wave vectors are the appropriate representation in system 5. This 
may be more easily seen if we use energy invariants as in 1 2- 6. By specifying 
with double wave functions a neutral particle of system A which had an 
energy invariant w 2 , we obtain a particle, also in system A, with energy 
invariant 136wm . This is depicted in system B as having an energy in- 
variant 10m 2 . By (14-51) there is left an amount m 2 to provide for the 
original particle which remains in the background. Thus the procedure of 
specification, besides providing a particle in system , stops up the hole in 
the background. 



268 Physical Applications [14-5 

Our result that the rest mass of a particle is to be identified with the 
threshold energy w 2 applies to scalar particles. The mass becomes multiplied 
by i(w p + ra c )/w = 136/20 when the scalar particles are replaced, as they 
eventually must be, by protons and electrons. We may regard the factor 
136/20 as a transformation of the scale of reckoning in passing from system 
A to system B. 

We can now see the complete connection between the masses of the 
elementary particles in quantum theory and the conception of mass as 
curvature in relativity theory. In quantum theory piass corresponds to the 
periodicity of the waves. The only direct connection of periodicity and 
curvature is the periodicity which arises from "going round the world", 
the corresponding wave length being 2?rJ?. The corresponding mass in c.o.s. 
units is A/27TJSc. At first sight this seems far too small to be important. But 
we have to remember that there can be only one scalar particle to which this 
applies; and the exclusion principle forces succeeding particles to have 
periodicities corresponding to the higher harmonics. The result is that an 
average particle has a mass comparable with that of an electron or proton. 
But there is still the factor 136/20 to be applied before we reach the masses 
ordinarily recognised. It is not necessary to go over again the explanation 
of its occurrence in Chapter xu. We may note, however, that since the rest 
mass is not an intrinsic attribute of the particle but represents the energy 
of the particle in an assembly in statistical equilibrium, it depends on the 
number of degrees of freedom which share in the equipartition of energy. 
The factor is really a compensation for adopting (in current theory) a 
simplified representation in which the recognised number of degrees of 
freedom is greatly reduced by the substitution of simple for double wave 
functions. 

14-6. The Standard Mass m . 

It is perhaps necessary to remind ourselves that we are not putting forward 
a new theory of natural phenomena; we are comparing two mathematical 
methods of treating the same system. Primarily we are treating an Einstein 
universe, i.e. a uniform static distribution of matter extending indefinitely; 
but in order to compare our scale of measurement with that used in quantum 
theory we insert in it a microscopic "added system". From the ordinary 
point of view (system B) the added system is the centre of attention; the 
equations used to describe it in elementary quantum theory take it for 
granted that its surroundings are uniform and static, as here supposed. The 
surroundings are normally represented by a metrical field g^, which in the 
uniform conditions postulated corresponds to a spherical space of radius It. 
The value of B is involved because, by the macroscopic theory, the material 
standard of length used in actual experiments on the added system takes up 



14-6] The Cosmical Problem 269 

an extension which is in a definite ratio to the radius of curvature of the 
corresponding three-dimensional section of space-time.f Thus we have to 
treat an added system represented by waves in surroundings represented by 
curvature. We now introduce a mathematical transformation of the pro- 
blem, which represents them both together by waves (system A). We first 
represent the uniform surroundings, or Einstein universe, by waves. The 
added system is then formed primarily by a modification of these waves 
a specification by double wave functions. But the modification can also be 
represented by the original waves together with additional waves; and the 
additional waves form the usual description of the added system. 

It is important to realise that the physical system studied in this 
chapter is precisely the physical system which a quantum physicist treats 
at the beginning of his subject. We are treating an Einstein universe; 
that is all except that we forgot to mention that one of its particles must be 
excited (one will be enough) in order that the natural constants h, m e , etc. 
may appear in the problem. The quantum physicist takes as his most 
elementary problem one particle in free space ; that is all except that he 
forgot to mention that there are some 10 79 other particles present in an 
equilibrium state. (For he believes that his formulae can be experimentally 
verified without destroying all the particles in the universe except one.) 
Our problems are identical; but our respective forms of absent-mindedness 
show that we view them from a different outlook. The two outlooks 
correspond to system A and system B. 

We must notice one point in the transformation which will be important 
in the numerical calculations about to be made. The limit state of system A 
becomes the K state of system B. We define the K state as that in which a 
particle has uniform probability distribution over spherical space. The 
threshold energy m z is that of a particle definitely at rest in system B and 
therefore having entirely uncertain position. The added particles in system 
B do not exclude one another, except for the slight negative exclusion effect 
represented by gravitational potential energy; thus, except in so far as the 
position is specified, any reasonable number of them can occupy the K state 
of uniform distribution. This change of K state occurs when we pass from 
the universe with p = 3P to the universe with P = the change which 
introduces the factor (14-153). In abolishing the pressure, the energy 
previously represented by standing waves is replaced by curvature. We 
must not duplicate the representation by waves and curvature. Energy up 
to the threshold energy being represented by curvature, only the excess 
remains to be represented by waves. Thus in system B a particle at rest is 
represented by a uniform distribution of probability, instead of by standing 
waves corresponding to a spherical harmonic of high order. 
t Maihematioal Theory of Relativity, 66. 



270 Physical Applications [14-6 

The connecting link between systems A and B is the mass ra of the 
neutral particle. In system B it has been defined as the mass belonging to a 
simple (progressive) wave function which possesses only an algebraic phase; 
equation (14-51) was derived as the necessary relation between the mass w 
of such a wave function and the mass m ( = m p or m e ) of a vector wave func- 
tion both referred to the geometrical frame. In system A our elementary 
scalar wave functions have mass w 2 ; but the waves are of a somewhat 
different character, being standing waves in three dimensions instead of 
progressive waves in one dimension. It would not have been possible to 
amalgamate progressive waves in three dimensions without introducing 
matrix phases. 

We already know that the scalar particle mi corresponds to four neutral 
particles ra , which on specification yield two protons and two electrons. 
There is an apparent discrepancy between the four particles and the three 
dimensions of the standing waves. This is rectified by the factor f found in 
(14-153). It is true that W = |w 2 ; but in ordinary reckoning w a is replaced 
1* <-*.. "that ^^ (14<61) 

It is convenient to get rid of the factor f by introducing an intermediate 
system A', which is the same as system A except that all masses are multi- 
plied by $. 

Similarly we introduce an intermediate system B' consisting of neutral 
particles with one-phase wave functions. The multiplication of the mass by 
136/20 then occurs in passing from system B' to system JB. 

We have then four systems defined as follows. System A is the direct result 
of the analysis of the curvature into elementary scalar wave functions 
constituting standing waves in three dimensions. In system A' we convert 
the Einstein universe with p = 3P into an Einstein universe with P=0, 
thereby increasing all energies in the ratio f ; this increase is the same as if 
the energy had been calculated for standing waves in four dimensions instead 
of three, and we shall regard it in that way. In system B' some of the limit 
particles of system A' with energy ra 2 ' are taken as the basis of an added 
system; the scalar particle with (effectively) four degrees of freedom is for 
this purpose divided into four neutral particles with one-phase wave 
functions and energy w = |w 2 '. In system B the ordinary vector wave 
functions are introduced and the total mass becomes multiplied by 136/20. 

In system JB' the coefficient m in the energy operator m-*8*l8xp8x a can 
be identified with m . This follows from the elementary formula for the 
plane waves of a neutral particle 



m = i d/ds = mo" 1 
This must also apply to system A 9 . But in system A the unit of mass is 



14-6] The Cosmical PrdbUm 271 

altered so that m=|m . Thus (14*144), which refers to system A, becomes 

P-l/JIXF, p-S/JZXF. (14-621) 

Hence the corresponding total mass in system A' is 

JT-fJlf-ipF-4/lZX, (14-622) 

which is the more explicit form of (14-162). This is the expression for the 
energy of a scalar particle forming a uniform distribution in spherical space 
of radius jR, i.e. it is the energy of the K state. We shall therefore change the 
notation M' to m K . It is convenient at this stage to insert the factor A/TT 
required to reduce masses to c.o.s. units.f We therefore write (14-622) as 



* m Q R 2 \7rf 

As we shall not further use system A, we shall omit the accents previously 
used to distinguish quantities referred to system A'. 

Consider first a universe which consists of one scalar particle. There can 
be no absolute comparison of standards in different universes; we shall 
therefore arbitrarily define the corresponding units to be such that m and 
R are the same as in the actual universe. The masses m 2 , m p , m e which are 
numerically connected with m will also be the same. But the natural con- 
stants K, h may be different. We denote their values in the one-particle 
universe by /c x , h l . 

In the one-particle universe, the K energy, the limit energy, and the total 
energy coincide, so that m K =m 2 = M. Hence, by (14-63), 

(14-641) 



so that Wo^/TrJZ. (14-642) 

Returning to the actual universe, let 

P = ? S N'. 

Then by (14-31) w a = M/p. That is to say, the wave function representing a 
single scalar particle uniformly distributed over the sphere J will have only 
1/pth of the energy required to produce the curvature 1/-B. In (14-641) a 
single particle produced the curvature 1/12, and the coefficient mr l of the 
energy operator was then m^ 1 (At/Tr) 2 ; we must therefore take the co- 
efficient to be 1/pth of this, namely m Q " 1 (A/w) 2 , where 

tf^p-ihi*. (14-66) 

From (14-642) and (14-65) 

(14-66) 



f We still omit the constant c. For the factor h/ir, see 9-6. 

j As explained earlier in this section the threshold energy w a corresponds to uniform 
probability distribution over the sphere in system B. System B corresponds to the ordinary 
equations of elementary quantum theory which define the constant h. 



272 Physical Applications [14-6 

We may note also that ic 1 =^/c, since the well-known condition for an 
Einstein universe, applied firstly to the one-particle and secondly to the 
tf'-particle universe, gives Kim ^^ R = Kpm ^ (14-67) 

Thus K/h* is independent of N'. 

14*7. Numerical Solution. 

Since each scalar particle corresponds to four elementary particles, the 
total number of protons and electrons is N = N f . Inserting the constant c 
to reduce to C.G.S. units, (14-66) becomes 

m Q = hV%N/27rRc. (14-71) 

We have also, by the ordinary relativity formula for an Einstein universe, 

\<nE = >cJ//c 2 = \KN (m p + m e )/c* (14-72) 

together with m p + m e = ( 136/10) m (14-73) 

by (14-51). 

From (14-71) and (14-72) 

N 7TC 2 



whence V^= _ = _ - _ Q4-74) 

* ~2 K m Q (m p + m e ) 10 2 K (m p + w c ) 2 v } 

by (14-73). We shall find in Chapter xv that Ac/27re 2 = 137; so that the result 
can also be written as 



10 ic( 

Formula (14-75) is the most suitable for an accurate determination of N 
from observation. As suggested by W. N . Bond the so-called ' ' observational ' ' 
values of e/m e are (from the point of view of wave mechanics) erroneous by 
a factor ^|f ; but the "observational" values of m p /m e are also in error by 
the same factor, so that the observational values for ejm p can be accepted as 
correct. I adopt the values given by W. N. Bondf from recent deter- 
minations: 

e//w e c= 1-7574. 10 7 (uncorrected); 1-7703. 10 7 (corrected), 
so that e/m p c= 1-7703. 10 7 ~ 1847-6 = 9582. 

For K the generally accepted value appears to be 6-664.10~ 8 , but it is 
uncertain to 1 part in 1000. Inserting these values in (14-75), we obtain 

JV P = 3-1454.10 79 . 

Following a suggestion by R. Fiirth that the core of the large number N 
is 2 256 , we can express the number %N of electrons or protons as 

#= 135-82. 2 256 . 
t Nature, 135, 825 (1935). 



14-8] The Cosmical Problem 273 

Since N is an integer by definition, no irrational factors can enter into its 
composition; and the possible hypotheses as to its constitution are very 
limited. In view of the close association of the numbers 136 and 256 in the 
theory of the double wave function, our result makes it probable that the 
exact value is JV = 136. 2 256 . In Chapter xvi we shall obtain this number 
independently from theory. 

Conversely, assuming JJV = 136 . 2 266 , and using the observational values of 
e/m p and c, we can calculate K. The result obtained by Bondf is 

* = (6-659 -0012). 1C- 8 . 

If the ratio of the electrical to the gravitational force between a proton and 
electron (calculated according to classical theory) is denoted by F, we have 

F== & = 1362 g2 

~ *cw, n w ~ 10 *c(w 



| QO _ 

Hence ( 14-75) gives F = - VN. ( 14-76) 



14-8. Alternative Treatment. 

If the scalar wave functions ^r, <f> contain a time factor e ikt , the particle 
density a contains a time factor e 2?7f/ . The volume is therefore expanding as 
e -2iM } an( j fa e linear scale as e~$ ikt . We can therefore connect the represent- 
ation of mass by waves with the theory of the expanding universe. 

Consider an expanding spherical universe whose radius at time t is 
R = K Q ef. The line element is J . 



x 9 y, z being the stereographic coordinates at time t. The pressure and 
density are found to be 



To represent the above imaginary expansion of linear scale, we must take 

(14-825) 



so that (14-82) becomes 

^P=-^ 2 -* 2 

V ' 



), J 
which is a generalisation of (14*12). 

t Nature, 137, 317 (1936). 

% Tolman, Relativity, Thermodynamics and Cosmology, equations (150-2), (150-7), (150-8). 

ETP 18 



274 Physical Applications [14*8 

As in 14-1, we replace the curvature by standing waves in flat space, so 
that p = 3P. Inserting this condition in (14-83) we have 

f& 2 + A = f JB-*= Sine/), (14-84) 

which is a generalisation of (14-151). 

The meaning of (14-84) is that, by adopting a slightly lower value of the 
arbitrary constant A, we can reserve a portion of Sirup to be represented by 
wave functions with a time factor e ikt . The reservation of a portion of the 
whole energy tensor or matrix for representation by waves has been dis- 
cussed in 11-7. Let l/pth of the energy tensor be thus reserved, so that 

|Jfc 2 = 3/2jpJZ 2 . (14-85) 

There are two ways of viewing (14-84). The macroscopic view is that 

A = SlTKip, %k 2 = &irK 2 p, 

so that the energy of the wave is provided by a general weakening of the 
gravitation constant involving a decrease of the gravitational mass of every 
particle in the distribution. But in that case the factor e iki belongs to a 
collective wave function of the whole particle density, and not to an in- 
dividual particle. The alternative view is that 

A = SlTKp! , ^fc 2 = &7TKp2 . 

In particular, if p = f N', the reserved density p 2 amounts to just one scalar 
particle in the K state. This corresponds to the elementary conception of 
a single particle represented by waves, with an impermeable background 
represented macroscopically. 

The mass corresponding to a time factor e ikt is 



In the case of a collective wave function, every particle has this amount of 
mass represented by waves; for its own particle density (equal to the 
product of its individual wave functions) changes with the expansion of R 
in the same proportion as the whole particle density. The total mass repre- 
sented by the waves is therefore hpk/7T. f This mass is now replaced by a single 
reserved particle whose individual wave function must accordingly be given 
a time factor e ipki . In ordinary reckoning (system A') the mass of the 
particle is multiplied by f , and becomes 



Hence, by (14-85), m ' 2= * }' (14-86) 

This result must, however, be divided by 2, because the energy density p 2 
here attributed to the particle is a mutual energy of the particle and the rest 

t That is to say, if the individual wave functions are of index 1, the collective wave 
function is of index l/p. 



14-8] The Cosmical Problem 275 

of the system; so that if we sum it for all the particles we obtain twice the 
total energy. It will be seen from (14*84) that if we reserve in succession the 
p equal amounts of density /t> 2 , the unreserved portion of -R~ 2 (which is left 
to be represented in system A' as curvature) diminishes by equal steps, so 
that m' 2 decreases regularly from (14-86) to zero. The corrected result 
(giving the energy reckoned according to the method of the self-consistent 
field) is therefore 

(14 ' 87) 



The mass m f here found is to be identified with m . It is perhaps rather 
difficult to see that it is ra rather than w a . But it is to be remembered that 
the mass constant both for the scalar particle and the neutral particle is ra ; 
and the fourfold energy of the former comes from its additional degrees of 
freedom. We are here treating the progressive waves in one dimension t\ and 
(14-87) applies to one particle with one degree of freedom. If we represented 
the four neutral particles by separate wave functions, each of them would 
have the time factor e ipki and the corresponding energy; just as, when earlier 
we divided the collective wave function into p individual wave functions, 
each of them had the energy corresponding to the time factor. 

The result (14-87) is accordingly an alternative derivation of (14-66); and 
the solution in 14-7 then follows. 

If we are not seeking the exact result, it is comparatively easy to obtain 
a solution of the cosmical problem sufficient to confirm the order of magni- 
tude of the recession of the spiral nebulae. 

Consider an object system and N unspecified particles. Observational 
determinations of the position of the object system must be relative to a 
physical frame of reference provided by the unspecified particles. Since 
they are unspecified, they have random positions in the hypersphere of 
radius R which constitutes space. The mean square deviation of a particle 
from the centre of the sphere is %R in each of the four coordinates; so that 
the mean square deviation of their centroid is ^R^N. Thus the physical 
frame of reference provided by the unspecified particles is such that its 
origin has a standard deviation %R/\fN compared with an ideally fixed 
geometrical frame. By the uncertainty principle the associated standard 
deviation of momentum is of order 2h^/N/R y or in mass units 2h\^N/Rc. 

Equivalently, the motion of each particle is a rotation about the centre 
of curvature of space and its wave function has a half-quantum h/2n of 
angular momentum in the plane of rotation, or a linear momentum hfirrR, 
the resultant of N such momenta in random directions is h^/Nf^trR. By 
either method we deduce a mass m^ of order hi/N/Rc as in (14-71), giving 
the energy of the physical reference frame referred to the geometrical 
reference frame. The observed relative motion of a particle referred to the 

l8-2 



276 Physical Applications [14-8 

physical frame is then apportioned between the particle and the frame 
according to the principles worked out in Chapters xi and xn, and the mass 
m p or m e attributed to the particle is given by the fundamental quadratic 
(14-51). 

It may be objected that by this procedure what is really the same com- 
parison mass ra is used over and over again for each particle in the universe. 
Quite so. That is the secret of the unexpectedly large masses attributed to 
the particles in quantum theory unexpected because the constant h which 
connects mass and curvature in quantum theory is at first sight much too large . 

In the exact determination of m in (14-71), several numerical factors, 
which might easily have been overlooked, have been introduced. We add 
some further remarks on these. 

The factor f arises as we have seen from the exclusion energy or gravi- 
tational potential energy of the particles, reckoned negative because the 
maximum exclusion energy or limit energy is our ordinary standard. For 
the present purposes N' excluding particles are equivalent to f N' non- 
excluding particles. The only point that arises is whether the same correc- 
tion should not have been used in (14-72). The answer is that (14-72) is a 
standard equation of relativity theory, here used to express our results in 
terms of the observed constant K. On referring to the derivation of this 
formula in general relativity theory, we find that M is the sum of the masses 
of the particles without deduction of gravitational potential energy (which 
is a non-tensor quantity). As we said at the beginning, our plan has been to 
solve the problem of a static distribution by two methods; (14-71) is the 
result of the solution by wave mechanics, and (14-72) the result of the 
solution by relativity theory. 

The factor f , introduced in passing from system A to system A', is perhaps 
less troublesome than the others, because it appears analytically and we 
are not directly concerned with its physical significance. We have seen, 
however, that it is due to the adaptation of the formulae to static problems 
in which the time dimension has specialised treatment, whereas the more 
elementary formulae are based on four coordinates to which Lorentz 
transformations are applicable. Alternatively we can regard it as due to the 
condition that, by considering a spherical space affixed radius JR, we have 
suppressed all waves in the radial direction. The actual R of the universe 
(referred either to an ideal geometrical frame or to a physical comparison 
standard) is, like all observables, subject to the uncertainty principle, and 
has a probability spread. But it is treated as fixed because, being the sole 
linear characteristic of the universe in its ground state, it is the standard by 
which all other lengths are measured directly or indirectly. This posterior 
fixity, conferred on R by its adoption as standard, conceals its intrinsic 
variability. 



14-8] The Cosmical Problem 277 

As soon as the static condition is relaxed, the variability of R (the local 
radius of curvature) forces itself on our attention; it is duly exhibited in the 
formulae of general relativity theory. In other dimensions, we pass from the 
non-static to the static condition by substituting constant standing waves 
for irregular wave motion. In the R dimension irregular wave motion is 
simply dropped, and is not replaced by standing waves. I do not suggest 
that this is an erroneous treatment; but we see how the fourth degree of 
freedom has come to be suppressed. 

To use four-dimensional waves in the present problem would defeat our 
purpose. The Einstein universe is unstable.! It is in statistical equilibrium 
only on the understanding that transfer of the energy of the three-dimen- 
sional waves into the radial dimension is inhibited. If transfer occurs, we 
have an expanding or contracting universe; presumably this is a state of 
pursuit of equipartition of energy an equipartition which can never be 
attained, since expansion of scale corresponds to a hyperbolic phase, and 
the wave functions in the radial dimension are exponentials instead of sines 
and cosines. 

Clearly it is more than a happy coincidence that the energy is multiplied 
by a factor f in passing from system A to system A' . So that before the 
division of the scalar particle into four elementary particles is actually made, 
a transformation equivalent to the substitution of four for three dimensions 
occurs. Perhaps it is better to express it the converse way; the four particles 
are effectively reduced to three before being packed into the static scalar 
wave function in three dimensions. 

Very difficult questions arise in regard to a factor 2 or 4; and it is only by 
the closest attention to the physical meaning of the analysis that we can 
hope to get this factor right. It is complicated at the outset because an 
erroneous factor 2 occurs in Dirac's theory through a confusion between 
double-valued and single-valued wave functions. It may be noticed that 
in the solution of the box problem (13-27) we take the momentum factor to 
be ife/27r whereas for the apparently similar standing waves here used we 
take it to be iA/7r. But in the box problem is an internal wave function 
describing relative motion as in the theory of the hydrogen atom; whereas 
here the scalar wave functions are referred to a geometrical frame. Further 
subtleties arise in the relation of scalar to neutral particles, in the repre- 
sentation of the particles of system B as an addition to, instead of a modifica- 
tion of, the particles of system A (so that when all the particles are specified 
we duplicate the universe), and in the liability to count the mutual energy of 
the particles twice over. The last correction was required in our second 
derivation of w , but not in the first derivation. I can only offer the solution 
here obtained as the best effort I can make to avoid these pitfalls with the 
t Perhaps the term "metastable" expresses the condition more precisely. 



278 Physical Applications [14-8 

feeling that it will be something of a miracle if I have really escaped them all. 
It is reassuring that the same value of N is obtained by an entirely inde- 
pendent investigation in Chapter xvi though even there a question of a 
factor 2 arises which is not easy to decide. 

Considerable light is thrown on these factors by a comparison with the 
theory of the Stern-Gerlach effect for which the corresponding factors 
can be checked observationally ( 12-8). In particular, the internal wave 
functions with four dynamical coordinates (comparable therefore with 
standing waves in four dimensions) are found to have a magnetic energy 
^ of that of a corresponding external wave function, whereas the single- 
phase wave function which defines m would (by the same treatment) 
have j 1 ^ of the energy. The 4 : 1 ratio is shown more directly in the 
magnetic energy than in the dynamical energy, because the masses assigned 
to the particles are such as to eliminate these factors and validate the 
current dynamical formulae which ignore them. The factor f shown in 
(12-86), which corresponds to the transition from strong to weak magnetic 
fields, is equivalent to that introduced in the present problem in the 
transition from disturbed to static conditions. 

I may add an explanation why it is necessary to treat the universe as 
composed of N f scalar particles rather than of 4JV' neutral particles in the 
main part of the investigation. A neutral particle has an equal probability 
of being positive or negative j^collection of 4JV' neutral particles has there- 
fore a probable charge eVW'. This is a large charge which would give 
a potential throughout the universe of the order 10 4 electrostatic units or 
10 6 volts. We cannot trust to probability to provide a neutral universe; we 
have to build it of units whose charge is definitely zero, i.e. scalar particles. 

It is worth noticing that in (14-74) the numerical factor J.-^f on the right 
has a simple relation to the numerical factor |.^(=|) on the left. The 
numbers 3, 10, 136 are the numbers of symmetrical components in 2-fold, 
4-fold and 16-fold matrices, respectively. I think it might be possible to 
exhibit the factor on the left as arising in the reduction of Dirac wave 
vectors to Pauli vectors (which are adequate for describing a static dis- 
tribution), in the same way that the factor on the right arises in the reduction 
from double wave vectors to simple wave vectors. 

14-9. The Recession of the Nebulae. 

Having found N, we can determine the Einstein radius B of the universe 
by (14-72). The result is 

jR= 1-234 . 10 27 cm. = 400-3 megaparsecs. 
The total mass and density in the Einstein state are 

M = 2-61. 10 55 gm. =* 1-32 . 10 22 x sun's mass, 
p e = 3-32. 10~ 27 gm. = 1 hydrogen atom per 500 cu. cm. 



14-9] The Cosmieal Problem 279 

The limiting speed of recession of distant objects is c/J?\/3 per unit 
distance. We obtain 

Speed of recession = 432 km. per sec. per megaparsec. 

The observational value of the speed of recession of the extra-galactic 
nebulae (usually given in round numbers as 500 km. per sec. per mp.) is in 
as close agreement as could be expected. 

In terms of the cosmical constant A the radius and mass of an Einstein 
universe are ~* 



In the mathematical theory of the expanding universe, it has been usual to 
treat three cases according as the actual mass M is greater than, equal to, 
or less than M e . The present theory leads to a different outlook; M is neces- 
sarily equal to M e . If we pay attention to relativity theory only, we have no 
ground for supposing that any static configuration of the matter of the 
universe can be found; therefore universes possessing no static configura- 
tions, i.e. with Jf> Jf e , have been considered possible. But when we treat 
the universe as a collection of N particles obeying the exclusion principle, 
we see that such a system necessarily possesses a ground state and therefore 
a static configuration; thus the universes without static configurations are 
inadmissible. 

I suppose that the ground state of the universe has hitherto been pictured 
as a highly concentrated "atom" with radius negligible compared to the 
present dimensions of the universe; so that it scarcely came to be considered 
in connection with the Friedman-Lemaitre theory. Our calculation shows 
that, on the contrary, it has a radius of 400 megaparsecs, and a density 
equivalent to 2 hydrogen atoms per litre. 

We thus discard the naive idea that in the beginning there was a cosmical 
constant; and that when the universe was made, the Creator had to decide 
whether the amount of matter created should be greater than, less than, or 
equal to the standard mass M e fixed by it. In the present theory A is of the 
nature of a constant of integration adjusted according to the actual mass of 
the universe. 

According to the Friedman-Lemaitre theory, if the universe started from 
the Einstein state of unstable (or metastable) equilibrium, it might either 
expand or contract. But since the Einstein state is the ground state, con- 
traction seems paradoxical. It would be interesting if a contracting universe 
could be ruled out in this way. But at present I cannot see that contraction 
definitely conflicts with wave mechanics; in comparing the ground state of 
the universe with the ground state of an atom we must bear in mind that 
the former is unstable and the latter stable. On the other hand, it seems 
impossible thai the contraction should continue indefinitely; and, if it 
cannot continue, it should somehow be prevented from starting. Formerly 



280 Physical Applications [14*9 

one supposed that the contraction might come to a natural end through 
quantum complications setting in when the particles became closely packed; 
but we now see that the " quantum complications" are already in full play 
in the Einstein universe. 

Although the relations between the natural constants have here been 
calculated for a special distribution of matter, they must hold for the 
irregular distribution in the actual universe. In determining the constant 
of gravitation experimentally, the physicist is not forbidden to arrange the 
matter in his laboratory in any way that suits the experiment; similarly the 
theoretical physicist is not forbidden to arrange the matter of the universe 
in any way that makes his calculation easier. In either case the value found 
for the constant will apply to all distributions, however widely they may 
differ from those used in the experiment or the calculation. Only we must 
take note that, in rearranging the matter to suit his purpose, the experi- 
menter cannot, and the theorist must not, violate any law of nature. Thus 
we have had to defend our rearrangement of the universe as a static con- 
figuration, by showing that the matter of the universe is necessarily such 
that it possesses a static configuration. 

These arguments would be complicated if we took note of the fact that, 
in addition to matter, the actual universe contains a certain amount of 
radiation. Since the amount is changing, it cannot be regarded as an essen- 
tial feature of the problem. It can, I think, be eliminated in the permissible 
"rearrangement" referred to in the last paragraph; but in any case the 
amount of radiant energy is trivial compared with the whole energy of the 
universe. 



CHAPTER XV 
ELECTRIC CHARGE 

15-1. Interaction . 

In classical physics an interaction between two particles means a difference 
in the behaviour of one due to the presence of the other. In wave mechanics 
we study probability distributions, and determine only probable behaviour. 
Interaction is therefore a difference in the probability of behaviour. The 
probability distribution of electron A , which specifies its chance of occupying 
a particular position or possessing a particular momentum, is modified by 
the presence of electron B. 

There are two ways of treating these changes of probability distribution. 
We represent the actual probability as the product of two factors: (1) the 
initial probability, or " basis of statistics", and (2) a modifying factor which 
incorporates any special information supplied. Analytically, the initial 
probability is the volume of the element of phase space containing the 
configurations considered; and the modifying factor is given by the product 
of the wave functions 0, <j>. The effect of the presence of electron B can be 
introduced either in (1) or (2). If we treat the presence of electron B as 
special information, its interaction is incorporated in the modifying factor. 
Alternatively we may regard it as a normal circumstance that the electron 
A, which we are considering, is one of several present in the region; we then 
incorporate the interaction in the initial probability and adopt a "new 
statistics'" for systems of two or more electrons. 

Both alternatives have been commonly employed in describing the inter- 
action of electrons. It is postulated that they repel one another with a 
Coulomb force. This tends to keep them apart, and thereby modifies the 
probability distribution which would have been attributed to them as 
independent systems whose probabilities combine by simple multiplication. 
It is also postulated that a system of several electrons obeys a new statistics, 
called Fermi-Dirac statistics. 

Both the Coulomb force and the Fermi-Dirac statistics describe an inter- 
action; that is to say, they assign to the electrons a probability distribution 
of position, momentum and spin different from the distribution for non- 
interacting particles, whose probabilities are independent and therefore 
combine by simple multiplication. But the Coulomb interaction is in- 
corporated in the modifying factor, and the Fermi-Dirac interaction is 
incorporated in the initial probability or basis of statistics. The Coulomb 
force changes the wave functions, so that they satisfy a modified wave 
equation containing an extra term called the Coulomb energy. The Fermi- 



282 Physical Applications [15-1 

Dirac interaction gives zero probability to the symmetrical wave functions; 
these are therefore omitted from the beginning, and the initial probability 
distribution of momentum is limited to such parts of phase space as corre- 
spond to antisymmetrical wave functions. Even if the two interactions were 
of independent origin it would be desirable to express them in more com- 
parable form. But I think it is obvious that current theory, by treating the 
interaction of electrons in this piecemeal way, has arbitrarily divided into 
two compartments a subject which is really one. It cannot seriously be 
maintained that the Coulomb force, which prevents two slow moving 
electrons from approaching one another, is an altogether distinct pheno- 
menon from the exclusion principle (contained in Fermi-Dirac statistics) 
which achieves the same result by forbidding 'them to occupy the same 
phase cell. 

It is not as though Fermi-Dirac statistics were intended to be a first 
approximation, giving the probability distribution in the limit when the 
electrons are so far apart that their Coulomb forces are negligible. The 
difference between classical and Fermi-Dirac statistics is only important 
when the electrons are crowded together; and the conditions in which we 
apply Fermi-Dirac statistics are precisely those in which the Coulomb forces 
are large. The attitude of current theory is altogether bewildering. It sets 
up an ideal scheme of statistics only to repudiate it (by introducing a large 
modification) in the very circumstances for which it is designed. 

This separation of the interaction of electrons into two effects strongly 
resembles the separation of gravitation and inertia in Newtonian mechanics. 
The latter taught that a body tends to move uniformly in a straight line 
by its inertia, but is pulled into a different path by the gravitational field. 
Similarly today quantum physics teaches that electrons tend to take up the 
probability distribution corresponding to Fermi-Dirac statistics, but are 
forced into a different distribution by their electrical repulsions. There is 
need for the same kind of unification of treatment that has proved so success- 
ful in the unification of gravitation and inertia. 

It is well known that Fermi-Dirac statistics arise from the indistinguish- 
ability of the particles concerned. If we are right in believing that Coulomb 
force is another aspect of the same interaction, it must also arise from the 
indistinguishability of the particles. To test this we must investigate the 
precise way in which indistinguishability modifies the enumeration of 
probabilities; so that we can determine its effect on the wave tensors, and 
hence on the wave equations which the tensors satisfy. The investigation 
is carried out in this chapter. We shall find that the effect of the indistin- 
guishability is to introduce an additional term in the wave equation, which 
turns out to be identical with that which has been adopted empirically to 
represent the Coulomb energy. 



15-2] Electric Charge 283 

The leading idea in the investigation is that the equations must be in- 
variant for interchange of the indistinguishable particles. Consequently we 
have a new kind of relativity transformation or "rotation" of the system, 
bringing about interchange. The interchange can be performed continuously 
a gradual transfer of probability from one identification to the opposite 
identification. The argument of the new transformation is called the per- 
mutation coordinate, and its conjugate momentum is called the interchange 
energy. On calculating the interchange energy, we find that it agrees with 
the observational value of the Coulomb energy. 

The foregoing argument was the actual starting point of the theory of 
protons and electrons developed in this book.f It was, I believe, the first 
introduction of the permutation coordinate and its conjugate interchange 
energy in wave mechanics. Now that interchange energy is regularly used 
in practical problems, it is difficult to see why the author's theory of the 
Coulomb energy of electric charges is still looked upon as a dubious excres- 
cence on wave mechanics. In the equations in current use the identity of 
interchange energy and Coulomb energy is accepted. J I do not understand 
why an investigation whose results have come to be admitted without 
question in the formulae in regular use is still commonly alluded to as a 
' * bold speculation ' ' . 

15*2. Interchange . 

The initial difficulty in calculating the effect of interchangeability is that 
interchange seems to be a discontinuous transformation. But wave mechanics 
has been successful in replacing quantum "jumps" by continuous analysis, 
and the same methods are available for treating the jump of interchange. 

Denote the coordinates, including suffix coordinates, of two particles 
(not necessarily indistinguishable particles) collectively by x, x'\ and let 
Y (x, x') be a wave function of the combined system. Let the operation of 
interchanging x and x' be denoted by Q, so that 

*V(x',x) = QW(x i x'). (15-21) 

If T is single-valued, $ 2 = 1. But the ordinary relativistic wave function 
has ambiguous sign ( 9-6). For wlien the corresponding space vectors *FX* 
are rotated through 360, so that all observable characteristics of the system 

t "The Charge of an Electron 1 ', Proc. Ray. Soc. A, 122, 358 (1929). 

j Dirao, Quantum Mechanics, 2nd ed., p. 228, equation (38). The interchange energy is 
given as JF ff {1 + (a r , a,)}, whose eigenvalue is the Coulomb energy V TB . The unitary matrix 
factor depends on the circumstances of the problem to be treated, and does not affect the 
identification. 

When T aj | (x^ 9 a/) is written as T (a? M , a; #/, j3), we call a, suffix coordinates. Suffix 
coordinates can take only the four values 1, 2, 3, 4. They are sometimes called spin co- 
ordinates; but this name should be reserved for angular coordinates conjugate to spin mo- 
menta. 



284 Physical Applications [15-2 

are unchanged, Y and X rotate through 180 and become Y, X. If the 
operation of interchange is treated as a rotation, repetition of the operation 
will bring us to the other branch of the double-valued wave function, so that 
Q2 T= _ Y Hence Q 2 =-l. 
We introduce an "extended wave function" denned by 

T(*,*',x) = e4xT (*,*') (15-22) 

Then, since # 2 =-l, 

'H*. *', x ) = (cos | x + <2 sin 



In particular T (x, x', n) = T (x' t x); and 

V(x,x' )X + *) = V(x',x, x ). (15-24) 

Thus the particles are interchanged by increasing x by IT. Intermediate 
values of x have a simple interpretation. By the usual rule (15-23) represents 
a superposition of the state Y (#, x') with probability cos 2 \x an( ^ ^he state 
Y (a/, a) with probability sin 2 #. The interchange is therefore represented, 
not as a sudden jump, but as a gradual transfer of probability from the 
original to the interchanged state, as x increases from to IT. This is the 
recognised method of treating transitions between discrete states in wave 
mechanics. 

Suppose that we are describing the distribution of a red particle and a 
blue particle. We use probability distributions to express our inexact 
knowledge of their positions. It is appropriate to provide in the same way 
for inexact knowledge of their colour. To take another example two golf 
balls have been driven at a short hole; there is a certain probability that 
there will be one ball on the green and one in the bunker; there is also an 
(unequal) probability that the one on the green will be your ball or mine. 
Therefore, besides stating the probability that there are particles at two 
points x, x', we can state the probability^ that the particle at x is the red 
one, or the probability 1 p that it is the blue one. This is provided for in 
the extended wave function Y (#,#', x) which includes a permutation 
coordinate x such that cos^x^JP- I* 1 general we treat a state represented, 
not by one value of #, but by a probability distribution over the coordinate 
X so that we have a probability distribution of what is itself interpreted 
as a probability. 

The wave function *F (x, x', x) primarily applies to the general case in 
which the two particles are distinguished with more or less uncertainty. In 
practice we confine attention to two limiting cases, namely definitely dis- 
tinguished, and entirely undistinguished particles. If the particles are 
definitely distinguished, # = or TT, and we can arrange that x shall always 
be 0. If the particles are indistinguishable observationally, x is an un- 
observable. In other words the rotation g = e* G x is a relativity transforma- 



15-2] Electric Charge 285 

tion to a different but equivalent frame of reference. This is an altogether 
new type of relativity transformation, which appears for the first time in 
double systems. But analytically it is of the same form as the well-known 
relativity rotations, and it has the same consequences. When x * s un ~ 
observable, the hamiltonian cannot contain x explicitly, but it will contain 
the momentum (if any) conjugate to #. That is to say, x is an ignorable 
coordinate. 

There is nothing mystical about the effects of indistinguishability. We 
do not suppose that an electron knows that it will not be distinguished from 
other electrons, and on that account conducts itself differently. We can 
imagine a being more gifted than ourselves who identifies each individual 
electron. He applies the ordinary equations of distinguishable particles to 
them, and his results are right; but his solutions do not interest us, because 
we can never obtain the observational data which he uses, and have no 
opportunity to apply or test his deductions. He observes, let us say, a red 
particle and a blue particle; he finds that the red particle is at x l at time ^ , 
and at # 2 at time t 2 , and deduces that it had a velocity (# 2 - 1 )/(^ 2 - $1) and 
a momentum w (# 2 -- a^)/^-^). We observe a particle at x l at time * lf and 
a particle at x 2 at time t 2 , but we do not know whether it is the same particle. 
Velocities of particles (and the corresponding kinematical momenta) are 
not observational data for us; and a system of dynamics which manipulates 
such data is useless for our purposes. 

The dynamical equations therefore depend, not on whether the particles 
are intrinsically distinguishable or indistinguishable, but on whether and 
to what extent they are in fact distinguished. 

We must distinguish the general wave function ^'(x.x'.x) which may 
be any function of its arguments from the particular wave function 
*( X >*'>X) defined in (15-22). The latter represents a uniform probability 
distribution in #. If the states corresponding to different values of x have 
different probabilities p x , their combined wave function is p^^(x 9 x' 9 x)l 
or more generally we can form a combination with different probabilities 
of different states T (x, x') for each permutation angle. 

Returning to the problem of the red and blue particles, the operators 
id fix, -id fix' do not give the momenta of the red and blue particles 
which are the momenta referred to in the dynamical equations for dis- 
tinguished particles. We can, if we like, regard -id/dx as the momentum of 
a composite particle, which has certain probabilities of being the red or blue 
particle. We have not the data required for applying ordinary dynamics, 
which does not profess to treat particles composite in this sense. But we 
can construct a formally similar dynamics by treating x as an additional 
coordinate; so that the whole momentum of the system includes, besides 
the momenta of two composite particles, a momentum of interchange. 



286 Physical Applications [15-2 

We have then to consider distributions of probability over the domain of 
x, x', x, and more especially to determine distributions which represent 
steady states. When account is taken of the spin components of the stream 
vector, the distribution is represented in phase space; and the ordinary 
136 -dimensional phase space of a double system is extended to 137 dimen- 
sions by the addition of a dimension representing change of g. We shall see 
later that the 137 -dimensional phase space is a development of the "aug- 
mented phase space" used in 12-3; so that from this point of view the 
permutation coordinate appears as compensation for the dropping of the 
relative time coordinate 2 ~*i when two systems are combined into one. If 
we have observational information e.g. an observation of colour which 
at any time distinguishes the particles with fairly strong probability, this 
will be represented as a wave packet in #; just as observational information 
as to their position is represented by a wave packet in x, x'. The wave packet 
will gradually disperse not because the particle is liable to change colour, 
but because we cannot trace definitely which of the particles whose colour 
we observed is the one which is now at x. 

From the general problem we pass to the two special cases. Firstly, when 
the particles are entirely undistinguished, the equations are simplified by 
the fact that x i 8 an ignorable coordinate and can be eliminated, leaving 
only a term in the hamiltonian which represents its conjugate momentum. 
Secondly, when the particles are completely distinguished, the equations 
are again simplified because x ^ constrained to be 0. This is more than a 
limiting case; it is a change in our point of view. We could not in an ordinary 
way prevent the wave packet concentrated at x = from dispersing. The 
constraint involves a change in the basis of statistics. An astronomer may 
inadvertently interchange the two components of a double star which he is 
observing; but he treats this as a mistake. He does not expect the laws of 
celestial mechanics to predict his " observational result". The constraint 
X = is imposed by stigmatising any other value of x as a blunder. We have 
seen that the dynamics of distinguishable particles cannot from its very 
nature be applied to indistinguishable particles; but why should not the 
dynamics of indistinguishable particles apply (so far as it goes) to dis- 
tinguishable particles? The answer is that theoretical physics is intended 
to agree with the experience of an observer who does not make mistakes, and 
will take varying forms according to the definition of what constitutes a 
mistake. 

Analytically, the probability distribution of distinguishable particles is 
limited to the 136 -dimensional section # = of the 137 -dimensional phase 
space. A change of the basis of statistics is made by limiting the initial 
probability distribution to 136 instead of 137 dimensions. Neglect of this 
distinction between the dynamics of distinguishable and indistinguishable 



15-3] Electric Charge 287 



particles has caused an error of a factor ff in many of the currently 
accepted formulae of quantum theory. 

From the present point of view protons are indistinguishable from electrons. 
It might be thought that the greater mass of the proton would be a dis- 
tinction impossible to overlook. But the mass is determined by the operator 
id/dt. We cannot differentiate the probability distribution of a particular 
particle until we have settled how to identify that particle at different times. 
Thus mass can never be used as a criterion for distinguishing particles; it 
presupposes that they have already been distinguished. 

The Principle of the Blank Sheet requires that at the start we should 
recognise no intrinsic distribution between the particles which we con- 
template, in order that we may trace to their very source the origin of those 
distinctions which we recognise in practical observation. The fundamental 
dynamics is the dynamics of indistinguishable particles; the dynamics of 
distinguishable particles is a practical adaptation to be used when we do 
not wish to analyse the phenomena so deeply. 

15*3. The Fermi-Dirac Law. 

In current theory it is usual to take the wave function Y of two particles to 
be single- valued. Then, if Q is the interchange operator, Q 2 = 1. In place of 
(15*22) we take an extended wave function 



T (x, x', x ) = 
Then e* i7 * = cos fr + iQ sin \TT = iQ 

and T (x, x', x + w) = Y (x' 9 x, x) as before. 
Consider the tensor transformation 



(15-32) 

where P is the interchange operator of the frames E^ , F^ of the two particles. 
Let their coordinates in three dimensions be x^ , x^', the time t being common 
to these and other particles. Their combined position is specified by a 
position vector 

) (15-33) 



in the double frame. We use the strain vector form of X, since the time is to 
be treated as invariant. A wave function of the two particles will be denoted 
indifferently by T ( Xfl , */) or Y (X). 

When the transformation (15-32) is applied, we have to consider, as in 
8-3, not only the direct change of T but the change of its argument X. 
Usually the transformation will introduce new matrices into X, so that 
*P (X) ceases to represent a distribution in the original 3-space. But when 



288 Physical Applications [15-3 

X = TT, we have g= P; and the transformation of the strain vector X is 
(since P is space-like) 



by (10-33). Thus we return to the original 3-space, but x^ and x^' are inter- 
changed. The transformed value of Y is 



by (10-37). Hence the combined result is 



*F' referring to the transformed distribution and *F to the original distribu- 
tion. Using the permutation coordinate, we can also write (15-34) as 

T (x, x', 0) = *F (x, x', IT) = e^- 1 ) T (a, *', 0) (15-35) 

by (15-31). 

Thus, in a sense, Q 1 = P- 1; so that we have found a possible form 
for the hitherto unidentified symbol Q. The actual relation is 

e iix?-D ^ |- e tfc<-p-i)] f (15-36) 

where the right-hand side is construed not as a simple multiplier but as a 
tensor transformation applied to the function which follows. (It is therefore 
not permissible to cancel out e~* f x on each side.) 

Let us now treat ^ as a dynamical coordinate, so that the transformation 
q gives an extended wave function representing a probability distribution 
over the seven-dimensional domain (x, x', #). The six-dimensional sections 
= 0, X = TT coincide with ordinary space (repeated for the two particles). 
If the wave function is single valued in ordinary space, Y (x, x', 0) and 
^(#,#',77) must agree; so that by (15-35), " = T. Hence, by (15-34), 



Thus the wave function is symmetrical for interchange of the particles. 
An antiaymmetrical wave function is obtained by the transformation 

g = ei f x(-^i) (15-37) 

which leads similarly to 



In this case (15-35) is replaced by Y' (x, x', 0) = -*F(o;,a;',7r); so that the 
transformation (15-37) is not a simple rotation in #. Belativistic rotation, 
or parallel displacement, in x generates a special wave function, which we 
have called the extended wave function; this is symmetrical. But, as we 
have pointed out (p. 285), the general wave function may be any function 
of the coordinates x, x' 9 #, subject to the usual conditions of single-valued- 
ness; and the antisymmetrical wave function is one of the general functions. 
Since it is not generated by parallel displacement in #, its covariant de- 



15-3] Ekctric Charge 289 

rivative with respect to x does not vanish. Denoting the antisymmetrical 
wave function generated by (15-37) by T, and distinguishing transforma- 
tion operators from multiplying operators by square brackets, we have 



(x, x' 9 x ) = &*<-*+] T (x, x') 

(a:,a 
, x') 



a; / , x ). (15-38)t 

The covariant derivative of Y a with respect to x * 8 contained in the factor 
[e**], since the covariant derivative of Y (x, x', x), which denotes as before 
the extended wave function formed by parallel displacement of Y (x t x') t 
is zero. 

The following remarks will perhaps make the significance of the 
mathematical procedure clearer. Consider a series of six-dimensional 
states V(x, x', a), distinguished from one another by a parameter a. 
In the special case in which a is unobservable this mode of division into 
states becomes degenerate, and the series can only be contemplated as 
a whole, i.e. as a seven-dimensional state. If a is promoted to the rank 
of coordinate, Y (x, x', a) is one such seven-dimensional state; but we can 
form other states p (a) X (x, x', a) by combining six-dimensional distri- 
butions with different (algebraic) probability coefficients P(OL). In the 
seven-dimensional state the probability fluid ( 81) is no longer restricted 
to flow in the planes a = const., and there will in general be a probability 
flux along the coordinate a, determined by the covariant derivative 
operator i8/Sa and therefore depending on jp(oc). This flux was not 
allowed for in the original equation of continuity of the probability fluid 
in six dimensions; thus X will not satisfy the same differential wave 
equation as X F. For given a, T (x, x', a) is a self-contained six-dimensional 
state, but X(#, x' 9 a) is not. In our application a is the permutation 
coordinate x ' an d the primary object of the investigation is to find the 
term in the wave equation for X, arising from flux in the x direction 
and depending on p(x)> which does not appear in the wave equation 
for MP 1 . To obtain its value we must determine p(x)- For an arbitrary 
angular coordinate a, the only limitatfan on p (a) would be that it must 
be periodic in a with period 2?r; but exceptionally p (x) has also to fulfil 
conditions at the half-period. We can interchange the particles, either by 
changing x to X + ^ *^ e wave function p (x) X (x, x', x) of the seven- 
dimensional state, or by interchanging x and x' in a six-dimensional 
section, and the two results must agree. Our analysis of this condition 

t The analytical part of the investigation is interrupted at this point. There is a direct 
continuation of it, starting from (15*38), in 15*7. 



290 Physical Applications [15-3 

can be put in the following form. We first ensure by our definition of 
X (x 9 x' 9 x) that its covariant derivative with respect to x vanishes ; that 
is to say, a change of the argument x represents only the effect of the 
relativity rotation provided by the indistinguishability of the particles. 
The test at the half-period then shows that, if X (x, x', 0) is a symmetrical 
function of x, x', no additional factor p(%) is required; but if it is anti- 
symmetrical, we must insert p (x) = [e*x]. Although p (x) is here expressed 
as an operator, we shall find later that it reduces to an eigenvalue e*x /187 ; 
so that the result is not inconsistent with our original limitation of p (a) 
to algebraic values. 

Our inability to distinguish observationally the separate six-dimensional 
states permits the probability to flow freely between them a flux which, 
if they could be distinguished, would be represented in a different manner, 
viz. by transitions. We do not here attempt to predict to what extent 
advantage will be taken of this freedom ; but we discover that the existence 
of a cyclic flux in the new direction will betray itself by its effect on the 
wave function for any of the six-dimensional sections. In particular, we 
have calculated the cyclic flux which is indicated when the flow in a six- 
dimensional section is that represented by an antisymmetrical wave 
function. 

It should be added that we have here adopted the solutions which give 
the smallest covariant derivative. There exist also solutions p (x) = [e 2ni *] 
for symmetric wave functions, and p(x) [d 2n+ *** x ] for antisymmetric 
wave functions. The ultimate effect of taking w = is that we shall deter- 
mine the minimum charge of a particle. 

According to the principle of Fermi and Dirac, the wave function of two 
elementary particles is antisymmetrical. Their theoretical treatment only 
went so far as to show that it must be either symmetrical or antisymmetrical; 
the choice of an antisymmetrical function depends on empirical considera- 
tions. We cannot at the moment go further than they did. Accepting their 
conclusion, the wave function for a pair of elementary particles must be 
taken to be T. 

We must now consider the "relativity of identity ". We have stressed the 
fact that a simple wave function $ (x) yields nothing observable, since it 
contains no reference to any comparison object for measuring x. This 
objection is removed in the double wave function T (a?, #'), since one particle 
can serve as comparison object for the other. But the objection reappears 
in Y (x, x', x)> since there is no comparison object for x- Just as we require 
two particles to provide observable differences x^x^, we require four 
particles a quadruple wave function to provide observable differences 
X #'. An absolute permutation coordinate referred to an abstract frame 
has no observational meaning, even when the particles are distinguishable. 



15-4] Electric Charge 291 

We may observe certain distinctions, but the interpretation which we place 
on these distinctions is not a matter of observation. We have temporarily 
shelved this difficulty by identifying the particles by a characteristic 
(colour) recognised in some supernatural manner; but actually colour might 
be changed by Doppler effects, and it is no more an absolute criterion of 
identity than position.f All we can do is to lay down a scheme of equations 
to be conventionally adopted as a criterion that x * s constant, and then 
measure changes of x relative to this standard. 

Standing waves and progressive waves aflf ord an example of the relativity 
of identity. We have seen that the particle represented by a standing wave 
is of composite identity (13*1). That depends on the usual view that a 
progressive wave represents a particle with single identity throughout all 
time. But it is equally possible to define the particles represented by 
standing waves as the real individual particles; then the progressive waves 
represent composite particles. 

Accordingly a system of two particles must be referred to a comparison 
fluid described by double wave functions and involving a comparable 
permutation coordinate XQ The angles x &nd XQ being referred to an abstract 
"frame of identity" are unobservable; but the probability distribution of 
the relative angles x XQ > contained in the quadruple wave function of the 
double object system and double comparison fluid, may have observable 
characteristics. The partition of the relative permutation angle into 
" absolute " permutation angles referred to a frame is an analogous problem 
to that which led to the theory of the Biemann-Christoffel tensor. 

From this point our investigation bifurcates into two problems. Firstly 
(15- 4-1 5- 6), we calculate the interaction of two particles forming a hy- 
drogen atom. Secondly ( 16-7), we calculate the interaction between pairs 
of particles in any assemblage. J There is a considerable difference between 
the methods used; and the fact that both lead to the same value of the 
Coulomb energy is a useful check on the details of the calculation. 

15-4. Interaction in the Hydrogen Atom, 

We shall now determine the interaction between two undistinguished 
elementary particles which form a steady system. We know that, for a steady 
state to be possible, the particles must be of opposite sign. The system is 
therefore identified with a hydrogen atom. 

f Consider a double star with equal components, having a very large orbital velocity in 
the line of sight. We can distinguish the two components by observing that one is red 
and the other blue. Half a period later we can again distinguish the red star and the 
blue star. But the stars thus identified do not obey the accepted laws of celestial 
mechanics. The accepted laws are obtained by identifying the blue star at time t with 
the red star half a period earlier. 

J Atomic nuclei are not considered. 

19-2 



292 Physical Applications [15-4 

As in 12-6, we re-resolve the hydrogen atom into an external and an 
internal particle of masses 

.i/-^ fl -m , /t=ii 6 m . (15-41) 

The time direction is as usual taken to agree with the momentum vector of 
the external particle. Interchange of the proton and electron does not 
affect the external particle. We therefore confine attention to the internal 
particle of mass p. Its coordinates f l9 2 , 3 are the relative coordinates of 
the proton and electron, and are reversed in sign by interchanging them. 
The change is most simply described by using angular coordinates, so that 
the displacements and momenta are rdO^, id/rdO^. Radial displacement 
and momentum are treated in the same way by Letting 6 f = log r . The inter- 
change is then equivalent to reversing the sign of r, leaving the angular 
variables unchanged. 

It may be well to repeat that the interchange is purely subjective as 
when an astronomer inadvertently interchanges the two components of a 
double star, and so publishes a position angle wrong by 180. By such a 
mistake relative coordinates are given the wrong sign; only in treating 
indistinguishable particles we do not count it as a mistake, since there is no 
criterion for deciding which is the right sign. 

Continuous interchange is represented by taking r = r e**; or if necessary 
we may take r = r e tt x, where Q is any symbolic square root of 1. 

The states of the internal particle will consist of probability distributions 
over the space coordinates f l9 f 2 , 3 and the permutation coordinate x- 
There is no time coordinate in an internal state. 

It would not be illegitimate to treat states in which the distribution is over 
i> 2* fa> on ty> with x a constant for the state. It is purely a question of 
practical application. As explained in 15-2 the resulting system of 
dynamics would be true, but useless to an observer who (owing to inability 
to distinguish the particles) could not obtain the data required for its 
application. Every angular coordinate has to be treated as we here treat ^; 
if the conditions are such that it becomes unobservable, a degeneracy is 
introduced, and the states which would have been distinguished by different 
values of 6 are run together into a single state. The ultimate reason for this 
is that in physical applications we have to take account of transitions (due 
to external perturbations) between the states given by different values of 0; 
as 6 approaches degeneracy these transitions become more frequent and the 
calculation becomes unmanageable; for completely degenerate 6 an in- 
finitesimal perturbation is sufficient to cause transitions, and the states can 
only be treated as a combined whole. Conversely, the transition causes only 
an infinitesimal perturbation of external systems and is therefore un- 
observable; if we cannot detect a transition, we cannot distinguish the states. 



15-5] Electric Charge 293 

We have to picture, or represent symbolically, a rotation which changes 
r continuously into r. For a picture we require an extra-spatial dimension 
a through which r can turn. For symbolic treatment a matrix or symbol E a 
must be associated with a in the same way that E l9 E 2 , E 3 are associated with 
f i > f 2 > f s Owing to the absence of a time coordinate, a symbol J 4 is standing 
idle; this is available to represent a dimension perpendicular to E^E^E^, 
and we therefore set E a =E^. The question whether E is really the missing 
EI does not arise; it is sufficient that its commutation relations with all 
other symbols in the formulae are identical with those of JS? 4 .f If E r is the 
matrix associated with the direction of r when x = 0, a point in the four 
dimensions will be represented vectorially by E r r + E^a, and the x rotation 



The spread of the probability distribution over the additional coordinate 
a introduces a corresponding term E^ d/da in the differential wave equation. 
Or, using the standard form (8-631) for a strain vector, we have 

" - (15>42) 

The extra permutation dimension thus plays the same part in an internal 
state as the time dimension plays in an external state. 

Except that we see that the energy of an internal state arises from inter- 
change, being conjugate to the linear interchange coordinate or, our only 
result thus far is to obtain the form of wave equation, already familiar for 
external particles. The equation expresses the conservation of probability; 
but the coordinates, etc., contained in it are unobservable, being referred 
to the frame and not to the comparison fluid. We have next to derive from 
it an equation suitable for practical use. 

15-5. The Fine Structure Constant. 

It is necessary to consider the internal particle in conjunction with its com- 
parison fluid. Since m = 136/x, the observable angular displacements 0' are 
analysed into a displacement = \\\0' of the particle and a recoil = ^9' 
of the partial comparison fluid. The question now arises whether the same 
partition applies to displacements of the permutation coordinate #. We see 
immediately that the partition of x is governed by quite different consider- 
ations from the partition of 6 which is based on the theory of the Biemann- 
Christoffel tensor. 

In the first place there is no need to admit any recoil of x- The comparison 
fluid is an idealised substitute for actual reference objects; and we are free to 
choose either a "distinguishable" or an "indistinguishable" comparison 

f The use of rectangular relative coordinates implies that the region considered is small 
enough to be treated as flat, so that E 6 also is idle. But this absence of E 6 is a casual feature 
of a special problem; it is not comparable with the enforced absence of E it arising from the 
definition of an internal state as a simultaneous state. 



294 Physical Applications [16-5 

fluid, i.e. fluids whose constituent particles are respectively distinguishable 
or indistinguishable. Inasmuch as the fluid replaces distinguishable macro- 
scopic bodies, it is a less violent distortion of the natural conditions to use a 
distinguishable comparison fluid. Then #0 is constrained to be zero, and 
there is no recoil. 

There is no breach with our previous conventions and definitions in 
admitting this exception, though it will complicate subsequent applications. 
Initially we were free to specify the recoil in each coordinate arbitrarily; 
but later we introduced the condition that the (negative) recoil in the time 
direction is such that the object particle and the comparison fluid move 
forward together in time. Relativity conditions then determine the recoil 
in any other direction which can be connected with the time direction by a 
relativity rotation. But the direction x & defined by a tensor of different 
rank, and is not connected with the other directions by a relativity rotation, f 
It would be more of a breach with our previous conventions to admit an 
indistinguishable comparison fluid; for in all previous references to the 
comparison fluid we have treated it as composed of distinguishable particles. 
In particular its connection with the metrical tensor g^ v has been fixed on 
that basis. To substitute an indistinguishable comparison fluid at this stage 
would derange the metric, and upset our standard equations. 

Nevertheless it is instructive to consider what would be the result of 
employing an indistinguishable comparison fluid. In actual phenomena, 
shown by a comparison of the microscopic object particles with macroscopic 
reference objects, the comparison fluid is eliminated, and the final results are 
unaffected. The use of an indistinguishable comparison fluid as intermediary 
will simplify the microscopic part of the theory at the expense of complicat- 
ing the macroscopic part of the theory of these phenomena. The macro- 
scopic theory is affected because the energy tensor of the comparison fluid 
determines the metric to which the actual measures of length are supposed 
to be referred. We have seen that continuous interchange is represented by 
the transformation r->rex. This transformation may be attributed to an 
absolute change of r, or to an absolute change of the standard of length. 
(Since the other variables are angular, r is the only quantity affected by a 
change of the standard of length.) The standard of length for the object 
system is contained in the comparison fluid the idealised substitute for 
metre rods, etc. Hence the partition of x' into x an d Xo corresponds to a 
partition of the apparent change of r into absolute changes of r and of the 
standard gauge. That is to say, the "recoil" of the change of r is a gauge 
transformation. 

f We now propose to represent it as relativistically connected in a four-dimensional 
picture, assigning to it the matrix E^E r ; but the condition of admissibility of this repre- 
sentation is the point we are now considering. 



15-5] Electric Charge 295 



For a recoil Xoiirx'f the gauge is modified by the factor e *' 187 and 
becomes complex (if Q = i) or matricised. If we are to pursue this method we 
must abandon Riemannian space and adopt WeyPs geometry which admits 
complex gauge transformations, or the author's extension of it which admits 
more general transformations of gauge equivalent to matrix transformations. 
In wave mechanics we prefer to keep to Riemannian space; then x can have 
no recoil; and the equations of the microscopic theory are complicated by 
the fact that x behaves differently from the other angular coordinates in 
this respect. Before studying the phenomena which result, we notice two 
points of interest: 

(a) The equivalent complex gauge transformation corresponds to an 
electromagnetic field (8*8). Thus the phenomena will be of electro- 
magnetic character. 

(6) The gauge would have been changed by a factor eWis? ( or by a ma trix 
having this eigenvalue); thus a coefficient 137 is introduced. This coefficient 
is known empirically as the fine-structure constant. 

We have seen that formally a takes the place of an imaginary relative 
time f 4 . We shall try to elucidate this connection. Let (x l9 x 2 , x z , t), 
($1, x 2 ', # 3 ', *') be the coordinates of the proton and electron, and let 
(f i > f 2 9 3 > T ) be the differences x x l , etc. By the definition of a combined 
system r is constrained to be zero. We may, however, regard a displacement 
dr as a transformation to a frame of reference with a different reckoning of 
simultaneity. Consider the transformation 



The interval from the origin to the point considered is (r 2 r 2 )*=r ; and we 
can show easily that the intervals between all other pairs of points are like- 
wise independent of u. Thus a physical system occupying the domain 
(i> &> fa) * s intrinsically unaltered by the transformation. Apparently all 
distances in it are expanded in the ratio coshw; but this is accounted for by 
the fact that a new reckoning of simultaneity has been introduced, which 
antedates each particle by a time r = r tanhw, proportional to its distance 
from the origin. 

The corresponding transformation for imaginary relative time q is 



r = r ex, a = r 
For small displacements from the zero state (u = 0, x = 0) we have 



Thus far our formulae refer to changes of the system of reference real 
or imaginary changes of reckoning of simultaneity. But we can employ the 
transformation in the usual way to define a series of states of a system in the 
same frame of reference. The states will represent the same system ( 19 2 , ,) 



296 Physical Applications [15-5 

expanded in the ratios cosh u and e*x respectively, the latter expansion being 
interpreted as in 15-1, as a change of relative probability of the direct and 
interchanged states. The u series of states is unclosed (hyperbolic trans- 
formation) and a distribution of probability over it could not be represented 
in phase space. That is to say, in analysing the total distribution of pro- 
bability over the domain (f l , f 2 , 3 ) , into the sum of a number of elementary 
distributions, we cannot accept a dissection into a series of states con- 
tinuously varying in scale. The corresponding expansion of the wave func- 
tion in a series of elementary wave functions would not be convergent. For 
that reason, when once we have fixed our frame of reference, the u trans- 
formation is excluded, and r is identically zero. But in saying that an internal 
state is formed by the simultaneous configuration of two particles, we refer 
to simultaneity in real time; they may be non-simultaneous in imaginary 
time if we can find a meaning for such a phrase. Thus the % series of states 
is admissible (as a mode of dissection of the total probability distribution); 
and since it is a closed series (circular transformation) a distribution of 
probability over it can be represented in phase space. 

This clears up a point which personally I have found most difficult. It 
had seemed to me that for a real displacement dr, just as for a displacement 
dx> there would be no recoil of the comparison fluid.f An increase of r to 
r + dr might be attributed to an absolute change of r or to an absolute change of 
the standard of length, the latter constituting the "recoil" of the change of r. 
But since Riemannian geometry excludes changes of gauge, we can admit 
no such recoil. The argument is the same as that which led us to exclude 
recoil of x which would represent imaginary change of gauge. In short, 
Riemannian geometry requires a comparison fluid rotatable in every direc- 
tion, but not expansible or subject to interchange of its particles. 

To see the fallacy of this argument, we must recall that the same dis- 
placement dgp may be produced by a variety of rotations in different planes, 
corresponding to different transformations of the wave vector. Therefore 
when we treat a displacement dr or a momentum i djdr, we must not jump 
to the conclusion that it corresponds to a u transformation. By reference 
to the mode of formation of the wave equation, we see that the r-momen- 
tum which appears in it is the E r5 component of a space vector, and corre- 
sponds to rotation about the centre of curvature of space-time; this, of 
course, produces the usual recoil of the comparison fluid as provided for in 
the R.C. tensor. 

The nature of a displacement d^ at any point whether it is a rotation 
about the centre of space-time, or about an origin in the domain (^ , 2 , 3 ) , 
or an expansion of scale cannot be immediately seen by inspection. It 

t If the argument were correct, the Coulomb energy term found in 15*6 would be 
duplicated. 



15-6] Electric Charge 297 

depends on the system of analysis into elementary states furnished by the 
four commuting operators W, U l9 U 2 , Z7 3 . But without entering into the 
details of this analysis, we can show that there will be only one coordinate x 
which produces no recoil. The argument above was that whatever applies 
to the imaginary transformation of r should also apply to the real trans- 
formation of r ; but this is forestalled by the earlier result that of two anti- 
thetic transformations one will be circular and the other hyperbolic, and 
only the circular transformation is admitted in our system of analysis into 
elementary states. Thus we can only be concerned with one of the two 
transformations. 

We may recall that an E^ transformation displaces positive and negative 
charges in opposite directions in neutral space-time, and is therefore asso- 
ciated with polarisation rather than simple translation (6*3). It is this 
aspect of the r displacement as a change of separation of positive and 
negative charges which is handled in the internal wave equation. 

15*6. The Coulomb Energy. 

In the wave equation (15-42) the internal particle is represented by a simple 
wave vector $. The justification for separating \ft from the double wave vector 
of the particle and comparison fluid, and treating it as an independent 
distribution, depends on the theory of Chapter xn; but no account was then 
taken of the permutation coordinate. To rectify this we must go back to the 
quadruple wave function of the proton and electron and the corresponding 
unspecified particles which form their comparison fluid. This is resolved 
into two interchangeable double wave functions; or into a double wave 
function of an external particle (without permutation coordinate) and 
comparison fluid, and a double wave function of an internal particle (with 
permutation coordinate) and comparison fluid. Considering the latter, the 
permutation coordinate raises the number of dimensions of the phase space 
to 137. If it behaves symmetrically with the other dimensions (which implies 
that there is the same recoil in x as in the other coordinates) the same theory 
of resolution into simple wave vectors applies, except that the ratio of the 
mass [L of the internal particle to the mass w of the comparison fluid is 
now given by 13 7/ = m (15-61) 

instead of 136/x = m . 

Hence, if the comparison fluid recoils in #, the wave vector r satisfies the 
wave equation of the form (15*42) 

' =o - (15>62) 

But our standard comparison fluid does not recoil in x, and we wish to find 
the wave equation of the corresponding wave vector 0. 



298 Physical Applications [15-6 



In ( 1 5- 62) a displacement da = irdx is accompanied by a recoil d^o 
of the comparison fluid, in accordance with the relation w ^Xo = //^X- To 
eliminate this recoil we must transform the frame of reference so that the 
permutation coordinate of the comparison fluid in the new frame remains 
constant. The required rotation of the frame is in the forward direction of 
Xo, since the "recoil" is really an advance (owing to the time-like character 
of the matrix associated with it). The transformation due to the rotation of 
the frame is therefore 



for small values of a. (15-63) 

The transformation matrix is properly the matrix F r4c associated with the Xo 
rotation of the comparison fluid in its own frame F^\ but, since _F r4 commutes 
with all the symbols in (15-62), it is for our purposes an algebraic square 
root of 1. Substituting in (15-62), we have 



so that (putting a=0 after the differentiation) 

-' (16 ' 64) 



which is the required wave equation. 
Comparing with (9-64), we have 



Hence the value of the fine structure constant is determined as 

(15-65) 



15*7. Interaction in Systems of Particles. 

We turn now to the general problem of interaction in an assemblage of 
elementary particles. The total interchange energy of a particle will arise 
from interchange with every other particle. The difference from the previous 
investigation is that, since the particle has to be paired with more than one 
other particle, the special coordinates x^ , ^ are inappropriate; and we have 
to obtain expressions in terms of the coordinates x^ , x^ referred to an arbi- 
trary origin. 

Considering a wave function T of two particles, we resume the investiga- 
tion in 15-3 at equation (15-38), where it was broken off. In that equation 
the operator etow-D O r [e^x(-^-] gives the change of T due to parallel 
displacement in x- (Displacement in x is, for indistinguishable particles, a 
relativistic rotation of the frame of identification, and these operators give 



15-7] Electric Charge 299 

the nominal change of T due to its being referred to the rotated frame.) For 
antisymmetrical wave functions there is by (15-38) also a "real change" of 
Y represented by the operator [e*x]. The co variant derivative 8/8^ measures 
the real change; that is to say, 8/8# for Y is equal to dfix for [e*x]. But [ft*] is 
not a simple multiplier; it turns Y (X) into e'x Y (e 2i *X). We shall show that 
the change of argument eliminates ^|f ths of the factor e*x. 

When X-+e 2i *X, the coordinates become e 1 **^, e 2 '*^'. Since Y repre- 
sents a probability distribution over real coordinates, its interpretation for 
a complex argument requires definition. We are interested only in the 
co variant derivative of [e**]; therefore any part of it which represents 
parallel displacement of the function Y (z, x' 9 x) on which it operates may 
be ignored. A parallel displacement is produced by applying a tensor trans- 
formation q = e i( * to all the symbols concerned; this yields 



where x' is the transformed value of #, whose tensor character we leave for 
the moment undetermined. We therefore define Y for a complex argument 
oftheforme 2 *X:by 

ei*^(&i* x ,e,K*x',x') = ^(x>x'>X)> (15-71) 

since the two expressions are in any case equivalent for the purpose required. 
We want, however, to find Y(e 2 * a #,e 2l ' a a; / , ^); for a will ultimately be put 
equal to x> an d we naturally do not apply the transformation to the argu- 
ment which defines the transformation.! 

The transformation of the double strain vector of the distribution can be 
treated as a uniform gauge transformation JR-> Retf of the radius of the 
137-dimensional phase space. Denoting the strain vector YO* by S, the 
first effect of the transformation is to multiply S by e 137 *0 on account of the 
change of measure of the volume element attached to it; and the second 
effect is to multiply it by e~ 18W on account of the renormalisation necessary 
through the whole volume of phase space becoming e 187 #Q. These are 
equivalent to the two compensating changes in (15-71). It does not much 
matter which way round we identify them; but presumably the first is the 
result of the change of coordinates in the argument, and the second is the 
direct tensor transformation of Y and O by a factor e* a , which gives a factor 
e 2ia in 8. Thus 2a= - 1370. 

If we exempt the coordinate x fr m ^e transformation, so that the 
volume element contains dx instead of e*Pdx, *^ e ^ ra ^ factor becomes e 138 #. 
The exemption is a simple matter, because the interchange matrix P com- 
mutes with all the space-like matrices in the (?Z>-frame, i.e. the frame in 

t That would lead to the same confusion as the statement "every number on this page 
should be divided by 2", which (being amended in accordance with its own instruction) 
implies that the numbers should be divided by 1, and therefore that they should be divided 
by 2, etc. 



300 Physical Applications [15-7 

which the symmetrical and antisymmetrical wave functions are separated 
( 10-6). The second factor e~ 137l ' is unaffected. The complete cancelling only 
occurs when the transformation (parallel displacement) is applied to all the 
quantities concerned; in withdrawing one of them from the transformation, 
we do not have to introduce any compensating change elsewhere. The 
resultant change of S is therefore e i36tf. e -i37tf ^ta/ia? The change of T or 
O is e a/i3 7 . Accordingly, putting a = x, 

[e'*] = e*x/ 137 . (15-72) 

Hence the angular momentum is 

-iS/8x= 1/137. (15-73) 

To summarise the argument from the beginning: Indistinguishability of 
two particles causes a degeneracy, which is treated (like other cases of 
degeneracy) by introducing a dynamical coordinate #. The angular momen- 
tum i 8 fix is quantised by the condition that Y shall be single-valued. The 
change of T with x may be simple parallel displacement, in which case there 
is no angular momentum;! or there may be an additional real change. But 
unlike other cases of quantisation, the condition of single-valuedness is 
applied after a half-period when the original distribution has reappeared 
with the particles interchanged; and the Fermi-Dirac condition determines 
the real change to be that expressed by the transformation operator [e**]. 
If a transformation [e /a ] were applied to *(X, x), X being transformed in 
accordance with its tensor character as well as T and X, the result would be 
a parallel displacement which contributes nothing to the angular momen- 
tum. In the present case the transformation [e l 'x] is not applied to the co- 
ordinate x whose value it fixes. Consequently, instead of the factor efx being 
completely neutralised by the change of argument of Y, 1 out of the 137 
coordinates of phase space is left untransformed, and fails to do its share 
in neutralising e l X Thus a factor e** /137 survives. 

We have now to insert in the wave equation an extra term representing 
the linear momentum which corresponds to (15-73). We recall that the 
linear coordinate a corresponding to the x rotation is the imaginary relative 
time of the two particles. So that, if r is the distance between the two par- 
ticles, dardx- Let iK r be the matrix (in the double frame) associated with 
the direction of r.% It will be a y matrix, which by (10-35) commutes with P, 
and is therefore unaltered by interchanging the particles and measuring r 
in the reverse direction. The matrix of the x rotation, given by (15-32), is 
i(P+l). The matrix of the direction of a is the product K r (P+l). 

t Other than the quantum, which is added in our usual non-relativistic treatment, and 
dropped again when the angular momentum is converted into linear momentum. 

J We take iK, so that the direction may be associated with a symbol whose square is 1, 
as in a simple frame. It is understood that the strain vector representation is used throughout. 



15-7] Electric Charge 301 

The new momentum to be inserted in the wave equation is therefore 
(of. (8-37)) 



This is divided into two parts. The term K r j\yir embodies the reduction 
from the metric naturally associated with indistinguishable particles, whose 
initial probability distribution extends over 137 dimensions, to the standard 
metric associated with distinguishable reference objects. If there are two 
particles only, one of which is taken as the origin of polar coordinates, so 
that the hamiltonian contains a term (iK r )( id/dr), we can amalgamate 
jfir r (3/ar+l/137r)into# r 8/8r by setting Y = r~ 1 / 137 ''Y / , and taking Y' as the 
new wave function (cf. the treatment of (8-38)). But since we have already 
treated the case of two particles more directly, it is unnecessary to pursue 
this further. It is sufficient to notice that the term K r /I31r is the equivalent 
in the general formula of the change of mass from /A to /x' in the internal wave 
equation of the hydrogen atom, which will be considered further in the 
next section. The remaining term represents the Coulomb energy: 

K P 

Coulomb energy = -- . (15-75) 



The main additional result is that the matrix associated with the Coulomb 
energy is K r P. We naturally associate the Coulomb energy (or more strictly 
the Coulomb momentum) with the direction of r; we now see that it is 
necessary to multiply the matrix giving the direction of r by the symbol P 
in order to obtain the direction of the Coulomb momentum. We may there- 
fore describe the Coulomb momentum as equal to P/137r in the direction 
of r. In current perturbation theory it is taken to be P x /137r, where P l is 
given by (10-385), the other factor of P being omitted. How far this omission 
is justified will depend on the nature of the application. 

Light is thrown on the occurrence of P by the transformation investi- 
gated in 10' 8. On multiplying matrices in the UP-frame by P, we obtain 
matrices in the G^-frame. Thus the factor P disappears when we adopt the 
GtfMrame, i.e. when we divide the system of two particles into external and 
internal wave functions. Conversely we might have anticipated that the 
purely algebraic Coulomb energy, found in the internal wave equation of 
the hydrogen atom, would acquire a matrix P when referred to the frame 
of the coordinates # , x'. 

The previous investigation treated the interaction between particles of 
unlike sign; the present investigation applies most obviously to particles of 
like sign. We must suppose that E^ and F^ are both right-handed or both 
left-handed frames; otherwise there is no interchange operator P. But since 
no assumption has been made as to the nature of the stream vectors of the 
two particles, they may well have charges of opposite sign. It would, I think, 



302 Physical Applications [15-7 

require deeper analysis to show that the sign of the energy depends on whether 
the charges are like or unlike. In this section we have been content to use the 
Fermi-Dirac principle (partly theoretical and partly empirical) to obtain 
the operator \ffx\\ the principle is equally consistent with [>-**], so that the 
sign of the energy is left undetermined. It is possible, however, to go behind 
the Fermi-Dirac principle and, by investigating the quadruple wave func- 
tion, trace the Coulomb energy to the absence of recoil of the x coordinate 
in the comparison fluid, as we have done in the special case of a hydrogen 
atom. I think it would not be difficult to verify in this way the dependence 
of the energy on the sign of the charges. 

15-8. The Factor -JfJ. 

A new point brought out in this investigation is that the mass concerned in 
the internal wave equation is not p, but ft', where, by (15-61), 



In considering the consequences of this, we must remember that //, does not 
represent an energy actually present. The mass of a hydrogen atom on the 
verge of ionisation is M ; and although, for the purposes of analysis, an 
internal mass p. or p! is added, it is subtracted again at the end of the in- 
vestigation. When the atom is in a lower quantum state, ft is involved as a 
coefficient in the energy tensor ft-^/S^S^ of the internal wave function. 
When ft is reduced to ft' the energy tensor is increased, and the energy 
differences between the different states are increased. The actual energy 
differences are therefore ^f| times greater than those calculated in the usual 
way from Sommerfeld's formula (9-372), which ignores this factor. f 

It will be seen that the decrease of ft has just the opposite effect to that 
which we should at first have expected. The explanation is that ft is not under 
any circumstances a rest mass, but occurs only as a divisor in the expression 
(w^ + w 2 2 + ro 3 2 )//i for the kinetic energy. In particular it is not permissible 
merely to substitute ft' for ft in Sommerfeld's formula. 

The following alternative treatment leads to the same result. Consider a 
proton and electron confined within a rectangular boundary, so that there 
is no angular momentum, and the steady state can be analysed into standing 
waves in three perpendicular directions. We take the energy to be either 
positive or so slightly negative that the size of the atom is comparable with 
the dimensions of the enclosure. Equating the energies of the proton and 
electron to the energies of the external and internal particles, we have, by 



The Coulomb energy is omitted. 

t In (9-372) the symbol /* is used for a quantum number, and the internal mass 
v +mt) is denoted by m. 



15-8] Electric Charge 303 

Subtract from each term the energy m of a comparison particle, so as to 
leave only the energy due to specification. Then 



(15-822) 
where W ' = w +/i=ffw (15-83) 



by (15-41). 

Thus in order that the external and internal energies may be treated as 
additive in the same way as the energies of the proton and electron, the 
internal energy must be referred to a comparison mass m f which is |f times 
the comparison mass for External wave functions. The current treatment 
replaces (/*-f ci 2 /^- w ') by a linear hamiltonian, taking it for granted that 
the theory developed for external wave functions applies. We have seen 
that it is necessary to decrease /z to ^', which is equivalent to increasing the 
comparison standard from ra to ra ', so far as the internal state is concerned. 
But when we introduce the condition that the energy of specification of the 
atom shall be the sum of the energies of specification of its internal and 
external states, (15*822) shows that the right course is, not to diminish ju, 
to p in the current (Sommerfeld) formula, but to treat the formula as 
expressing the energy in a unit which is W '/w times the unit used for the 
external energy. 

Ideally we can measure experimentally the energy e /x of a particular 
quantum state of the hydrogen atom, and hence determine the mass /u by 
Sommerfeld's formula. The external mass M can be found experimentally 
by some procedure equivalent to counting the number of atoms in a quan- 
tity great enough to be weighed macroscopically. From M and /x, m p and 
m e can be calculated. But the ordinary determination of the masses by this 
method, which omits the factor ^ff , will be incorrect. The observed energies 
-/*, or the observed energy differences between different quantum levels, 
must be decreased in the ratio f|f before being compared with Sommer- 
feld's formula. 

Since M is very nearly the mass of the proton and p, is very nearly the mass 
of an electron, the practical effect of neglecting the factor is that the so- 
called observed mass of the electron is ff | times too great, that of the 
proton being correct. It was first pointed out by W. N. Bondf that the 
observed values of the various constants would come into line with the 
author's theory, if it could be assumed that the observational determinations 
of e/m e really determined Hf e / m e- At that time I was aware that a 
factor ||f would be involved, but had not been able to determine its precise 
incidence. 

t Nature, 133, 327 (1934). 



304 Physical Applications [15-8 

More precisely the uncorrected observed determinations of m p and m e 
should be the roots of 

10m 2 - 136wm + Hw 2 = 0. (15-84) 

Their ratio is 1834-1. 

We cannot assume that the factor ff will occur in, or have the same 
incidence in, all methods of determining m p /m e or e/m e . The theory of each 
method should be examined in detail in the light of what we have learned. 
In the next section, we shall express our result in a different form, which 
shows the range of its observational consequences more clearly. 

15-9. Revision of the Constants e, m^^m^h. 

In saying that 1847-6 is the correct value of the mass-ratio, and that the 
"observational value" 1834-1 is in error through neglect of a factor in the 
reductions, we are employing the definition of mass or energy which has 
been regarded as fundamental in wave mechanics, namely as the value of 
the operator ( - ih/27r) 3/3J, h being a universal constant. Naturally we do 
not guarantee that 1847-6 will be the correct value if mass is defined in some 
other legitimate, and perhaps preferable, way. It appears that the deter- 
mination of e\m e by the deflection method leads to the mass-ratio 1834-1. 
It cannot well be supposed that the factor j|f is concerned in this case. But 
the deflection method determines a mass which satisfies the classical 
definition. It is therefore not necessarily in conflict with our determination 
of the mass-ratio according to the quantum definition; though it has an 
important bearing on the larger question whether for general purposes the 
quantum definition of mass is the best to adopt. 

The discovery of the factor J|f creates a new situation; and there can, 
I think, be no doubt that the most satisfactory way of restoring order is to 
admit two constants A, h' to be used in connection with internal and external 
wave functions respectively. The remaining constants e, m e , m p have unique 
values. This is a radical change. Prom our first introduction to quantum 
theory we have been taught to regard E = hv as its most inviolable principle. 
But we did, in fact, tacitly abandon it, when we were forced to recognise 
that the momentum operator depends on the index of the wave function, 
and that the index of Dirac's Lorentz-invariant wave function is not the 
same as that of the commonly used wave function of the hydrogen atom. 
Moreover, if any definition has to be altered, h is the obvious victim ; because, 
unlike e, m p ,m e , it does not occur in classical theory. 

We take A' 



Then, if the double-valued internal wave function with operator ( t H/TT) 3/3a? 
is taken to be of index 1, the double-valued external wave function with 
operator ( - ih'lir) 3/3^ is of index j| . 



15-9] Electric Charge 305 

The two fine-structurfe constants axe 

a=:*c/27re a = 137, a'^A'c/ 2 ^ 2 ** 136. 

The fine-structure constant is merely a name for the number of dimensions 
of the double phase space concerned. 

As compared with our previous results, external masses will be reduced in 
the ratio jff, and internal masses will be unchanged. Counting ra as an 
external mass, our previous results become 



Hence m p , m e satisfy ( 15-84), and the mass-ratio is 1834- 1 . Bond's correction 
has been eliminated by the change in the definition of mass. It was originally 
required to reduce internal energies to the unit which was used for external 
energies; but we have now redefined external energies in a way equivalent to 
changing the unit. 

Let us consider the deflection method of determining mass. It would 
ideally be possible to determine the ratio m p jm e by comparing the deflections 
of a proton and electron projected with known energy in the same magnetic 
field. The motion will be in accordance with the wave equation 

(S# A -m)0 = 0, with p IA = (~ih'l27T)dldx (A + K lA , 
the wave functions being external. As h' will not appear in the ratio of the 
two deflections, the fact that it differs from h will not be revealed. The ordinary 
calculation will therefore give the correct mass-ratio, which on the new 
system is 1834-1. 

So far as radiation is concerned, the normal constant h is applicable; for, 
in absorption and emission, radiation is connected with the quantised, i.e. 
internal, states of material systems. 

Thus in general the unconnected observational determinations of the 
natural constants should be consistent with a = 137, m p /m e 1834-1. Either 
they depend only on classical theory, and do not introduce A; or they depend 
on internal wave functions or radiation frequencies. To introduce h' we 
require an experiment determining the absolute wave length of the external 
wave functions of electrons or protons. Presumably the diffraction of 
electrons by matter involves A'; arid the ordinary calculation will give the 
scale of the diffraction pattern ff-J times too large, unless the effect is 
concealed by a compensating factor. At present it is not possible to attain 
this accuracy. It is difficult to devise any other experiment in which the 
factor could manifest itself. 

Summarising our conclusions we find that the definition of energy in 
wave mechanics by the formula E=hvis not in all cases consistent with the 
established meaning of the term "energy" in classical theory. In this section 
we restore the classical reckoning with e, m p ,m e a& fundamental constants. 
We calculate the ratio m p /m e to be 1834-1. The ratio e/m p or e/m e must be 



306 Physical Applications [15-9 

found experimentally since it involves our arbitral^ standards of mass and 
length. It can be found from the Faraday constant, by the oil-drop method, 
or by deflection methods, none of which involve h. We must further consider 
how results that have been expressed in terms of h are to be reduced to 
determinations of e, m p9 m e . We first, by using a=Ac/27re 2 , express them in 
terms of a instead of h. Then a denotes the number of dimensions of the 
double phase space of the wave function concerned and its comparison fluid, 
and may be either 137 or 136. In the experiments which furnish the most 
accurate values of the constants the value is 137; but the theory of each 
experiment should be scrutinised with this point in view. 

This summary covers most experimental determinations of the con- 
stants; but there is one important experiment "which introduces new con- 
siderations. We shall treat it in the next section. 

15*95. The Crystal Grating. 

It is now considered by leading authorities that the various observational 
methods of determining e, m p9 m e , h cannot be completely reconciled by 
current theory. The discrepancy, as usually stated, is that the value of e 
found by the crystal grating method is definitely greater than that found by 
other methods. To put the blame on e is merely conventional; the dis- 
crepancy is in the relations of the constants, and can equally well be attri- 
buted to m p . The question naturally arises whether this discrepancy has 
anything to do with the factor jff. 

We must first exhibit the disagreement in a form free from irrelevancies. 
The crystal experiment consists in comparing the diffraction of the same 
X-rays by (a) a ruled diffraction grating, and (b) a natural diffraction grating 
formed by the lattice structure of a crystal. The comparison determines the 
lattice interval in terms of the known interval between the lines of the 
ruled grating. Thus we obtain the dimensions of the lattice in centimetres, 
and hence the number of lattice cells in unit volume, or in unit mass of the 
crystal. Since the lattice is formed by molecules, this gives the number of 
molecules in unit mass of the crystal. Using the known relative atomic 
weights, we deduce the mass M of a hydrogen atom in grams. 

This appears to be the only practical way of determining M without 
introducing e or h. But we can make another determination via e. A theo- 
retically simple way of finding e is to measure macroscopically the charge 
2ne provided by a large number of a particles; the number n of particles, 
whose charges are collected, is ascertained by actually counting them. 
Further, e/M can be found by an electrolytic method equivalent to accumu- 
lating the charges of a known mass of hydrogen until an amount large enough 
to measure macroscopically is obtained. By combining e and e/M, a value of 
M is obtained which is less than that found by the crystal grating method. 
The observed ratio agrees with f|f within the limits of observational error. 



15.95] Electric Charge 307 

There appears to be gnly one vulnerable link in this chain of connection, 
namely the assumption that in the crystal there is one molecule to a lattice 
cell. That depends on a classical picture of the particles in the crystal. In 
wave mechanics the structure of the crystal is represented by standing 
waves. It is not necessary to suppose that the "particles of composite 
identity", which are more or less located at the lattice points, represent an 
equal number of pure particles. In more elementary problems the invariance 
of the density of eigenfunctions secures that different modes of analysis into 
particles yield the same total number of particles; but this does not apply 
when the number of dimensions of the phase space is different in the two 
analyses. A crystal is a typical example of a system composed of indis- 
tinguishable elements. Any pair of elements will have a permutation co- 
ordinate and an interchange energy conjugate to it. It is possible, and even 
probable, that in constituting the standing waves of a system of this 
character, 137 particles of pure identity furnish 136 composite particles. 

The problem lies rather beyond the point which our theory has reached. 
But if the observations are trustworthy, it would seem to be a direct deduc- 
tion from them that there is a discrepancy in the two ways of counting 
particles in a macroscopic aggregation. In counting a particles, no question 
of interchange arises, because they are observed one at a time. Ideally we 
might vaporise the crystal, and count its molecules in the same way; but 
instead, we try to count the molecules as they lie simultaneously in space. 
It is scarcely surprising that we should become confused in our count of 
entities which have no definite position and are indistinguishable from one 
another. We count something; but not the particles which we should count 
if they passed successively before us. In the crystal we identify a molecule 
by the lattice cell which it occupies, disregarding the fact that the molecules 
counted by the other method could not be localised individually. This 
procedure introduces a permutation coordinate. The extra coordinate 
increases the energy invariant of a particle in the ratio yf |, in accordance 
with the law of equipartition of energy. We have thus to assign to each crystal 
molecule a mass ^ff times that of a corresponding gas molecule as the 
observations actually indicate. This is not energy of crystallisation (which 
is too small to be considered); it is presumably a sign that, in the mathe- 
matical transformation of the external wave functions of gas molecules 
into internal wave functions of a crystal, 137 gas particles become re- 
partitioned in 136 crystal particles. We can thus suggest a possible origin 
of the discrepancy of the crystal results ; but it would require a much fuller 
investigation to determine whether the factor jf^ is an observable correction, 
or whether it is compensated in the complete theory of the experiment.! 

f I understand that (in view of still more recent experiments) the present opinion is 
that the supposed crystal grating discrepancy is spurious. 



CHAPTER XVI 
THE EXCLUSION PRINCIPLE 

16-1. The Second Quantisation. 

Let J r (r=l, 2, ... n) be n commuting idempotent symbols. Let U be a 
common eigensymbol of the J r , and let e r be the corresponding eigenvalue 
ofJ r . Since J r a -J r = 0, e r = or 1. 

By 3-7 (e) an eigensymbol for any given set of eigenvalues e r can be 

found explicitly, namely 

(16-11) 



where V is any symbol. Since J 8 2 J 8 : =0 9 and e 6 a e 8 = 0, 

(Js-es)(Js-l + **) = <>> (16-12) 

from which it follows at once that ( J 8 e 8 ) U = for all values of s. 

Let g=n(J r -l + e r ). (16-131) 

i 

We shall regard q as a transformation operator. It transforms any expres- 
sion V 89 which is not in general an eigensymbol of the J r , into 

U 8 = qV 89 (16-132) 

which is an eigensymbol giving the set of eigenvalues defined by q. By 
(16-12), q is singular; there is therefore no inverse transformation. Also 

b y ( 16 ' 12 ) qJ. = qe t . (16-133) 

A case of special importance is 

V^J 8 V 89 (16-141) 

which gives on transformation 



Ze 8 U 8 = X e U 8 , (16-142) 

i i 

where 2 e denotes summation over those suffixes for which e,= 1. 

Similarly, if F = SJ r J 8 K %8 , (16-143) 

we have U = Xe r e 8 U, t8 = X e U rt8 , (16-144) 

the summation S e being restricted to pairs of suffixes both of which corre- 
spond to eigenvalue 1. 

To apply this in physics, we take J r to be an existence operator for an entity 
or condition c r . IfJ r has the eigenvalue 1, r is said to exist in the system 
described by the eigensymbol V\ if the eigenvalue is 0, e r does not exist in U. 
In the system described by V which is not an eigensymbol of J f , the entity 



16-1] The Exclusion Principle 309 

e r is not definitely pre^nt or absent, and e r is said to have partial existence 
in V. Two kinds of partial existence are commonly recognised; r may have 
a probability of existing in F; or, without being wholly in F, it may have a 
component in F. In the former case, what may be called the "degree of 
existence" is expressed by a numerical relation to complete existence; in 
the latter it is expressed by a directional relation. Both are combined in a 
vector which possesses magnitude and direction; and, as we shall see later, 
the general existence symbol J g , which replaces the classical existence 
symbols (1 for existence, and for non-existence), can be represented as a 
strain vector. 

To understand the importance of this investigation, we must recall that 
in the primary developmqpt of quantum theory a system is looked upon as 
the product of its parts rather than as the sum of its parts. We have treated 
double systems in this way in Chapter xn, and quadruple systems (a proton 
and electron with their respective comparison particles) in Chapter xv. 
But for large assemblages the product method becomes unsuitable. If the 
probability of one member of the assemblage vanishes, the probability of 
the whole assemblage vanishes. That takes us far from the macroscopic 
outlook in which the existence of one particle more or less scarcely matters. 
Now it is important to develop a microscopic theory which shall converge 
to the macroscopic theory when the number of particles becomes very great. 
The basis of this new development must be the assignment of additive 
instead of multiplicative characteristics to the parts into which the system 
is analysed. 

To a certain extent we have anticipated this new development, especially 
in Chapters xm and xiv. But we there freely employed the exclusion prin- 
ciple, and we now return to the beginning of the theory to discover its 
foundation. 

In (16-142) we have practically the classical conception of an additive 
characteristic. The quantity U is made up of contributions U 8 from the 
various particles fi , the summation being of course limited to the particles 
which exist in the assemblage to which U refers. The assemblage is,however, 
viewed from the standpoint of selection rather than creation] so that S C C^ 
appears as part of a universal sum ZU 8 . This is now treated as a particular 
case of an assemblage of partially existing particles with characteristics V 8 , 
which are additive in a generalised sense F = J s V 8 , the symbolic coefficient 
J 8 being a measure of the degree to which the sth particle is present in the 
assemblage in the sense explained above. This generalised addition is of the 
same type as that used in expressing a vector as the sum of its components, 



In calling the entities f "particles'* we are redefining the term partide, 
so as to correspond to the conception of a system as the sum of its particles 



310 Physical Applications [16-1 

rather than as the product of its particles. As wet have pointed out, the 
particles in macroscopic systems in equilibrium have composite identity as 
compared with the pure elementary particles originally introduced. 

We shall refer to the present development as secondary quantum theory, 
the multiplicative combination of particles being primary quantum theory. 

16-2, Jordan - Wigner Wave Functions . 

We shall now express V = S J 8 V 8 in a form, due to Jordan and Wigner, which 
is well known in quantum theory. This transformation is not of any particular 
importance from our point of view, %J 8 V 8 being not only the most direct but 
also the most convenient expression of the additive combination rule; but 
it is desirable to show how it is connected with tjie customary outlook. 

In the application contemplated, the suffixes s refer to the pure elementary 
states that can be occupied by particles, and V 8 is the stream vector of the 
sth state. In general a state is only partially occupied; equivalently we can 
say that a particle whose stream vector is V 8 is partially present in the 
system considered. 

Since the states are pure, V 8 is the product of two wave vectors. The 
Jordan-Wigner theory is limited to the case in which the stream vector of 
each state is a perfect square. We can therefore take V 8 to be a strain vector 
of index 2 equal to if> 8 . 8 , or a strain vector of index equal to $ 8 . t/t 8 , where 
$ s is the complex conjugate of iff ( 8*6). It is understood that these are outer 
products; it would be inconvenient in the present investigation to indicate 
this explicitly by the asterisk notation. 

The characteristic equation J 8 2 7 8 = of an idempotent operator is of 
the second degree, so that it can be represented by a twofold matrix. We 
shall therefore represent J 8 by the idempotent twofold matrix 



where & is a Pauli matrix, i.e. any degenerate twofold matrix whose 
square is 1. In treating E J 8t \V 8 , we wish to resolve J 8t \ into two factors to 
be associated respectively with the two factors of V 8 . It is possible to express 
J 8t ^ as the product of two 2-vectors; but in the Jordan-Wigner theory a 
different kind of factorisation is employed. Let ^, fl , be three perpen- 
dicular Pauli matrices. Then by (3-86) 



(16-212) 

We can therefore write 

J*X = *A> 1-4,A=A, (16-22) 

where a* 88 &(/* + *) = * (/*-**) (16-23) 

and i 8 is a square root of 1, commuting with the 's. 



16-2] The Exclusion Principle 311 

Since the existence operators J r , J 8 commute, the Pauli matrices used for 
their representation commute; that is to say, we must employ for the 
different particles Pauli matrices ^ , J^, vr and A , 8 , ^, v > 8 , belonging to 
different commuting symbolic frames. 

The Jordan-Wigner wave functions are 



. (16-24) 

Forming their outerf products TT and TT, we obtain, if we omit the 



, 

=A,J 



V ' 



by (16-22). We call A the complementary stream vector to V. It represents the 
stream vector of the states to the extent to which they are unoccupied, in 
the same way that V is the stream vector of the states to the extent to which 
they are occupied. 

The omission of the cross-terms is to some extent justified when the 
elementary wave functions $ 8 form an orthogonal set, so that their products 
vanish on integration over the whole domain. It is in this integrated sense 
that *PY and V are equivalent. Physically this correspondence is sufficient, 
because we have no right to treat a wave function except as a whole. But, if 
we consider the universe of completely occupied states, given by S V 8 = V -f A, 
we come back to the classical conception of addition ofV 8 . In order that our 
formulae may converge to those of classical theory, it is necessary that 
TT + YT should converge to F-f A locally as well as on integration; that 
is to say, the cross-terms in TT + TT must vanish. 

The point is that when a state extending over a region of space is partially 
filled, the location of the entity which is (partially) occupying it is a matter 
of probability; and we have the complication usual in statistical theory that 
the expectation value of a product is not the same as the product of the 
expectation values. But when the state is fully occupied, it represents a 
definite distribution which can be treated as a classical fluid having a deter- 
minate density at every point. 

The rs cross-term in W + TT is 



Since the 0's commute, this will vanish if 

a r a 8 + a 8 a r =0, a 8 a r +a r a 8 =Q. (16-261) 

Since ^ r and ^ commute, (16-261) will be satisfied if 

V8 + Vr=<>. (16-262) 

f Outer products so far as 0, and $ t are 'concerned. The symbolic coefficients follow 
their own commutation rules found below; ift a and ift a commute. 



312 Physical Applications [16-2 

That is to say, the factors i s in the different termr must be taken to be 
anticommuting square roots of 1. Hence we have also 

a r a, + a 8 a r =0, a r a^+a a a r = (r^s). (16-271) 

Also by (16-22) a 8 a 8 + a a a s = 1. Combining this with (16-261), we obtain 

a r a 8 + a a a r = S r *. (16-272) 

The equations (16-271) and (16-272) satisfied by the coefficients of the wave 
functions (16-24) are known as the Jordan-Wigner commutation rules. 

16-3. Einstein -Bose Particles. 

In the foregoing analysis we have supposed that the states or particles are 
fully distinguished by the additive characteristic^ which is being studied. 
We shall now consider states with a distinguishing characteristic W 8t , such 
that the states W 8l , W 82 , . . . W^ all have the same V 8 . Let J 8t be the existence 
operator for the particle occupying the state HJ,; then (16- 141) is replaced by 

F = S M J 8 ^==SA^, (16-311) 

where # 8 =J sl + J, 2 + ...+ J 8W . (16-312) 

Since the J ri commute and have eigenvalues 0, 1, the eigenvalues of K s 
will be 0, 1, 2, ... m. Its minimum equation is therefore 

A'(A-l)(A:-2)...(A r -w) = 0. (16-32) 

Let A:' = #-im. 

Then, if m is even, (16-32) gives 



ii / 

K' n 

-*m\ 



l + ^Lj = 0, (16-33) 



where r takes successive integral values from - \m to \m. As w-oo, this 
tends to the limit 8inw /r = 0. (16-34) 

Similarly, if m is odd, the limit is cos nK ' = 0. 

Particles, of which there may be any number in the same recognised state 
V 8 , are called Einstein-Bose particles, as distinguished from excluding or 
Fermi-Dirac particles. The distinction is practical rather than fundamental; 
the Einstein-Bose case arises when the recognised states V 8 comprise a great 
number of ultimate elementary states W 8f9 i.e. when the practical classi- 
fication of states is less minute than the theoretical classification. We have 
then to associate a cardinal operator K 8 instead of an existence operator J 8 
with the recognised state, and its eigenvalue gives the number of particles 
in that state. 

The use of K ' instead of K depends on the theory of the self-consistent 
field. We cannot make a large change in the number of particles in a state 
without upsetting the conditions which governed the original analysis into 



The Exclusion Principle 313 

states. Strictly speaking we must know what states are occupied in order 
to determine the controlling field, and a knowledge of the controlling field 
must precede the analysis into states. In practice this means that we 
employ an approximate or average distribution of the particles to calculate 
the self-consistent field. Then we must provide symmetrically for the 
addition of extra particles, and the subtraction of particles allowed for but 
not present. The characteristic to be considered is therefore V'=*ZK 8 V 8 , 
whe^e 2K' is an operator which takes the eigenvalues 1, 2, 3, ... according to 
the number of additional particles definitely present, and 1, 2, 3, ... 
according to the defect in the number of particles. This is fulfilled by 
operators satisfying the characteristic equation shirr/?' = 0. 

The substitution of K a ' for K 8 is equivalent to the substitution J ri \ for 
J 8t in (16-311). We may regard half of the energy (or other mechanical 
characteristic) of the particle as allotted to the self-consistent field and 
thereby contributing to the energies of the particles in other states. Then if 
the particle is fully present we have to add its half of the mutual energy; if 
it is absent we have to subtract the energy wrongly included in the self- 
consistent field. This is provided for by the eigenvalues , J of J J. 

Another point of view is that (/ ) and K' are relative existence 
operators, as distinguished from the absolute existence operators </, K. 
It would be absurd to schedule everything which does not exist. To call 
attention to the non-existence of an entity implies that we had some 
a priori expectation that it would exist. To regularise this outlook we 
contemplate a comparison system consisting of entities e r which have an 
equal probability of existing or not, so that their respective existence 
operators have expectation values J . The object-system is then specified 
by relative operators J r %, which express deviation from the comparison 
distribution. This procedure in secondary quantum theory corresponds 
to that which we have already introduced in primary quantum theory in 
11-6. In connecting the results of primary and secondary theory it is to 
be noticed that the ideal comparison fluid of secondary theory consists of 
a set of half-occupied states, whereas the comparison fluid of primary 
theory is the impermeable background of fully-occupied states formed by 
the unspecified particles of the universe. 

In the usual applications the number m is treated as infinite, so that 
(16-34) applies. A solution is 

K 8 '=-id/d0 89 (16-35) 

where 9 8 is a periodic argument of all operands contemplated. For then 



Thus &>*&*= e- iTrK '\ so that sin7r^' = 0. We thus obtain a representation 

of v ' ** 



314 Physical Applications [16-3 

It is appropriate that a new coordinate 8 should Ve introduced, since the 
existence of states W^ not discriminated by the Values of V 8 implies that there 
exist other characteristics, e.g. momenta 3/30,, unrecognised. A connection 
with classical theory is obtained by setting 



where t is a common argument for all the functions V 8 . Then v 8 is the fre- 
quency of V 8 \ but V 8 is not necessarily a simple harmonic function. By 

(16-361) i 137 

P_ * jj*a. (16.362) 

27T V 8 Ot 



In particular, if V 8 is a momentum vector id/dx^ , the resultant momentum 
vector is 

r 



which may be compared with the momentum T 4 given by the energy 
operator (13-12). 

In current quantum theory Einstein-Bose particles are represented by 
wave functions analogous to the Jordan-Wigner wave functions of Fermi- 
Dirac particles. It is assumed as before that the stream vector V 8 is a perfect 
square. V is written in the modified form 



Since K 8 now operates on one factor only, we must take K 8 = - 2id/d6 8 . 
A possible factorisation of K 8 is 

g t = e** f a 8 = dlda 89 (16-371) 

since a 8 a 8 = 3/3 (log a 8 ) = - 2i d/d0 8 . Thus, if cross-terms are omitted, 

F' = W = (Sfe)(Za^). (16-372) 

Further, if/ is any symbol, 



so that a 8 a 8 a 8 a 8 =l. And since all other combinations of a 8 , a 8 commute, 
the complete commutation rules are 

(16-381) 
0. (1,6-382) 

These may be compared with the Jordan-Wigner commutation rules in 
(16-271), (16-272). 

In problems concerning a specified system, the unspecified particles will 
in general be Einstein-Bose particles. For it is clear that the type of analysis 
into states V 8 applied to the specified system does not provide anything like 
enough states to accommodate the 10 79 unspecified particles separately. 



16-4] The Exclusion Principle 315 

For example, consideibthe box-problem ( 13*2) applied to material of the 
density of water. If (to avoid introducing supernatural barriers) we suppose 
the uniform density to continue indefinitely, we obtain a space whose radius 
is of the order 3 . 10 8 km., and a mass about 10~~ u of the mass of the universe. 
In this case the recognised states provide room for only 1 in 10 14 particles; 
so that in (16-312) m=10 14 . The unspecified particles provide the imper- 
meable background with occasional vacancies which we have identified 
with photons. Any number of vacancies up to 10 14 may occur in one recog- 
nisedenergy state; and the photons accordingly obey Einstein-Bose statistics. 
It will be seen that a change of outlook can have the effect of changing 
Fermi-Dirac particles into Einstein-Bose particles. The theory of Chapter 
xivis based on a system of excluding particles (system A); but these become 
transformed into Einstein-Bose particles in the usual outlook (system B). 

16*4. Relation to the Energy Tensor. 

It is important to realise the considerable change of outlook that is involved 
in passing from the primary to the secondary quantum theory. The analysis 
which shows that in given field conditions there will be a series of steady 
states, composed of probability distributions with wave functions tf/ a or 
stream vector functions V 8 , is common to both. In primary quantum theory 
a particle may have its total probability (unity) distributed between different 
states, so that its stream vector is V = Zp 8 V 8 . If a second particle is present 
with the stream vector V = p a ' V 8 , there is no warrant for forming the sum 
(p^ +PS) V a . The only combination of V and V contemplated is the product 
FF', which indicates a set of double states V r V 8 ' occupied with probabilities 
PrPa- If *b e addition p 8 +p a ' were admitted, the total probability of a state 
might exceed unity, and the exclusion principle would be violated; this 
contingency does not arise, since the conception of such an addition is 
foreign to the outlook of primary quantum theory. In secondary quantum 
theory a different type of particle is introduced (composite from the point of 
view of primary theory), each particle being limited to one state. The 
numerical probability factor p a is now replaced by a vector probability 
factor </ 8 . The exclusion principle is represented by the fact that, the eigen- 
values of J 8 being and 1, its expectation value will always lie between these 
limits. 

The general theory of relativity throws light on the meaning of V = J 8 V 8 . 
Macroscopically the additive characteristic of a system is its energy tensor. 
Thus the secondary quantum theory rightly rejects the sum of vectors 
f or Zp 8 V 8 , and employs the sum of tensors of the second rank %J 8 V 8 . 
Moreover, owing to the non-linearity of the equations, the self-energy tensors 
of particles are not strictly additive in general relativity theory. The additive 
energy is a mutual energy of the particle and the rest of the universe. The 



316 Physical Applications [16-4 

energy is therefore not the square of the momentum vector V 8 V 8 , but the 
product of a special factor V 8 and a world factor J 8 . We have already seen 
how we endeavour to reconcile this with our usual non-relativistic outlook 
by artificially breaking up the mutual energy into self energies of the 
particle and the rest of the universe (treated as comparison fluid). The 
convergence of the energy tensor %J 8 V 8 of secondary quantum theory to 
the energy tensor T^ v of general relativity theory, as we pass from the 
microscopic to the macroscopic outlook, is exhibited in (16-363). Since 
the macroscopic outlook implies a much less minute discrimination of 
individual states, the quantum expression for the energy tensor is first 
reduced to Einstein-Bose form S K 8 V 8 . 

t 
16*5. Enumeration of Wave Functions . 

By using the observed values of e/m and /c, we have found that there are 
approximately 136.2 256 protons and an equal number of electrons in the 
universe. This will now be confirmed by an independent investigation which 
shows that the number is exact. 

Our ability to predict the number of protons and electrons in the universe 
implies that the number is imposed by the procedure followed in analysing 
the interrelatediiess of our experience into a manifestation of an assemblage 
of particles or wave systems. It is a commonplace that electrons are not 
intrinsically distinguishable from one another; it is therefore not surprising 
that the total number, allowed for in our scheme of dissection of phenomena, 
depends on the conventional distinctions introduced when, for example, we 
decide that a certain diffuse wave packet is composed of two electrons 
rather than one. 

We first notice that the value we have found for the total number of 
particles N belongs to a series of numbers 

^=2.3.24, #2 = 2. 10. 2 16 , JV r 4 =2.136.2 256 , 

i.e. n (n + 1 ) 2* 2 , with n = 2, 4, 1 6, .... If ^/i is regarded as characteristic of a 
simple wave function, N^ and N^ will be the corresponding characteristics 
of double and quadruple wave functions. We shall first investigate a "uni- 
verse" described by simple wave functions and show that it consists of N 
(i.e. 96) particles. It will then be comparatively easy to show that the actual 
universe contains N 4 particles. 

Conditions we can scarcely call them phenomena which require for 
their representation no more than a simple matrix frame have been treated 
in Chapters i to vin. We can represent a position vector or a stream vector, 
but not both; if the position vector is represented in an ^-frame, we have to 
change to a (/-frame to represent the stream vector ( 10-5). The familiar 
particle, whose position vector and stream vector are both inexact but not 



16-5] The Exclusion Principle 317 

entirely uncertain, cannot be represented on this plan. But in Chapter vm 
we made a new departure by introducing D-operators by means of which 
we were able to define vector functions instead of isolated vectors; and in 
particular the momentum vector was defined. In fundamental investigations 
of the kind which we are now treating, the stream vector and the momentum 
vector coalescejf but their coalescence is, strictly speaking, 



For the operator id/dx^ applies to a representation of positions, and it is 
necessary to change to another frame to obtain a representation of the 
stream vector j^ . 

Consequently, by using position and momentum instead of position and 
stream, we are able to keep the whole representation of the particle within 
one matrix frame at the cost of using differential operators as well as the 
matrix operators of the frame. 

This is an illustration of the use of differential operators to avoid multi- 
plicity of frames. It can be greatly extended; and, as is well known, particles 
are generally distinguished by allotting to each a different orthogonal wave 
function, rather than by allotting a different matrix frame as contemplated 
in 2-9. It is true that we must still assign to them existence operators J 8 
which are represented by Pauli matrices in different frames; but that, as it 
were, segregates the question of multiplicity of frame from the main part of 
the analysis, and eliminates it altogether in the case of particles having 
complete existence in the system (J 8 = 1). 

In the systems containing many particles which we have hitherto treated 

the box problem in Chapter xin and the Einstein universe in Chapter xiv 

we were concerned only with fully occupied states, so that existence 
operators (other than unity) with their attendant multiplicity of frames did 
not appear. If the scalar state is not fully occupied, we must in general 
express the extent of its occupation by an idempotent operator J. In (13-61) 
we divided the scalar state into two substates J and 1 J, with equal and 
opposite spin. The purpose of dividing an entity into two parts is that we 
may conceive of one part existing without the other; so that the division is 
made by distinguishing a part J 'which is occupied (since J. J= 1 . 7) from 
a part 1 J which is unoccupied (since J ( 1 J) = 0). The further separation 
of the state into substates with opposite charges depends on a double 
existence operator J r J s ' for two entities, since charge is meaningless unless 
there are two charges to interact. The four substates (13-62) are represented 



t We regard non-algebraic wave functions, whose momentum vector has matrix com- 
ponents and is not to be identified with j^, as a side-development of the wave functions 
primarily studied. As explained in 9*5, the steady states defined by them are only con- 
ditionally steady. 



318 Physical Applications 16-5 

The system represented by a simple wave functjpn has four dynamical 
coordinates, and in field-free conditions we may take the steady state to 
be (cf. (8-53)) ^^ ^ ^ ^ e ^*M**i^*^*4fa 9 (16-52) 

where ^ is a common eigensymbol of the four commuting ^-symbols. The 
dynamical angular momenta i3/3a l9 etc., have eigenvalues . Each set 
of four values defines a corresponding eigenfunction 0, and the 16 com- 
binations of sign therefore define 16 eigenstates. 

In our earlier treatment we regarded a particle as distributed with pro- 
bability factors p r between the different eigenstates; to introduce a second 
particle would involve beginning the problem over again with double wave 
vectors. In secondary quantum theory the states are considered to be 
occupied by 16 different partially existing particles, which in special 
cases may be definitely present or absent. We have therefore to assign 
to them existence operators J r (r=l, 2, ... 16) stating the degree of 
occupation. 

The expression (16*52) singles out a particular plane J5 23 as the plane of 
spin, and this choice will be reflected in the value of the eigenvector . The 
choice might equally well have been E 3l or E^ . This will be investigated in 
greater detail in the next section; but it is fairly evident that the effect of 
these alternatives will be to triplicate the set of independent wave functions, 
making the total number 48. This number is again doubled when we allow 
for two possible algebraic frames (1, i), or an equivalent duplexity. The 
total then becomes 96 or JV^. 

We now begin to see the make-up of the numbers N l9 N 2 , N. The 
elementary states correspond to wave functions of the form e** (:Md ^ . 
The factor 2 w2 corresponds to the variety of the exponential; and the factor 
n(n + 1)/2 corresponds to the variety of ^r ; finally there is a factor 2 corre- 
sponding to the duplexity of the algebraic frame. We notice that in N l9 the 
four dynamical coordinates give a factor 2 4 , whereas the three geometrical 
coordinates give a factor 3. This is because the wave function depends on a 
combination of eigenvalues of the dynamical momenta and a choice of 
geometrical axis. Similarly in N 2 and N^ the large factor arises from the 
number of combinations of eigenvalues of operators with a common eigen- 
symbol, and the small factor represents a choice of "orientation" of this 
eigensymbol. 

The final doubling of the number is the most difficult step to investigate. 
It is fairly obvious that the duplexity of the algebraic frame will contribute 
a factor 2; but it is less easy to be sure that the factor has not already been 
included in the enumeration. 

Before studying the method of enumeration more rigorously, we must 
make clear the connection between the existence of N eigenfunctions of 



16-5] The Exclusion Principle 319 

type (16*52) and the formation of a corresponding universe of N particles. 
In Chapter xrv the particles constituting the Einstein universe were 
represented by wave functions in a flat space, formed by stereographic 
projection of the "actual" spherical space. Here we begin at the other 
end of the problem; for the wave functions (16*52) are those occurring 
in elementary quantum theory, which, as we have seen (14*3), pre- 
supposes an Einstein universe as background. These wave functions 
occupy actual space; and the proper energy of the N l particles, being 
represented by the curvature of space, cannot be represented a second 
time in the wave functions themselves. By the dynamical theory these 
eigenfunctions give the only possible steady states of disturbance of the 
system; there are therefore just N independent steady modifications 
(generalised states of rotation). But, owing to the symmetry of the con- 
ditions, there is a degeneracy which permits us to re-analyse their most 
general combination into various alternative sets of N^ elementary states ; 
in particular we can analyse it into an equivalent number of spherical 
harmonic distributions. By projecting these into a flat space, we restore 
to the wave functions the proper energy which was abstracted to provide 
the curvature, and obtain wave functions corresponding to those of 
Chapter xiv. The latter, by definition, correspond to steady states in 
the self-consistent field produced by the whole aggregation, so that they 
must conform to this method of enumeration. 

Lemaitre has pointed out to me that the ambiguity <E 16 <x 4 in (16-52) 
seems to be in conflict with (8*54), where the algebraic dynamical co- 
ordinate was given unique sign. But this ambiguity is seen to be necessary, 
when we remember that the set of wave functions, when fully occupied, 
must not provide a resultant proper energy additional to that represented 
by the curvature of the space to which they are referred. In primary 
quantum theory reversal of the algebraic coordinate would turn the 
particle into a minus-particle; and here the double sign will imply that 
in some sense the 96 composite particles can be regarded as 48 plus- 
particles and 48 minus-particles though we must not too hastily assume 
that the latter are positrons and negatrons. Since we are treating a set 
of particles whose resultant energy tensor provides the curvature of the 
space in which their individual wave functions are represented, the con- 
tributions of these wave functions to the energy tensor must cancel out, 
and there must be as many minus-particles as plus-particles in the set. 
It is therefore right to include the minus-particles in the enumeration. 
This relative representation provides the simplest method of counting the 
independent wave functions; but to obtain the particles ordinarily re- 
cognised in physics we must restore to them the energy abstracted to 
form the curvature. By this transformation they all become plus-particles, 



320 Physical Applications [16-5 

as in 14-2. To put the conclusion in another form -/'-elementary quantum 
theory always describes its systems as additions to the impermeable 
background which constitutes the Einstein universe. If therefore we 
describe by the wave functions of elementary quantum theory the set 
of particles which composes the impermeable background, the plus- 
additions and minus-additions must balance. But the minus aspect is only 
relative, and the particles are all positive contributors to the total energy. 

16*6. Geometrical Representation of J r . 
By squaring (16-52) we obtain the strain vector of index 2 

F(a l9 a a , a a , a 4 ) = e l ' a i^ a ia 3^4^, (16-611) 

the matrices being eliminated because V Q is an eigensymbol. Since ^ = <A ^o> 
it has the pure form F = J (i + j (i + tf w) . (16-612) 

We can distinguish the possible functions V as V r (u,v), the suffix r 
(r= 1, 2, ... 16) indicating a particular combination of signs in (16-611) and 
the arguments u, v indicating planes in the two Pauli frames. 

Consider the existence operator J of V. J is represented in a Pauli frame; 
and since there is a possibility that V may be an eigensymbol of J, i.e. that 
/F = V, this must be one of the two frames of V. Thus the and frames 
have the fundamental distinction that one of them (namely the spin frame ) 
is also the frame of the existence operator. To interpret this physically we 
recall that J is the world factor in the energy tensor JV ( 16*4). In the 
conditions represented by a simple wave tensor the "rest of the universe" 
is understood to form a uniform static distribution defining an electrically 
neutral three-dimensional space. On the other hand V involves four dimen- 
sions and is not electrically neutral; thus it contains an additional factor 
(1 + iO v ) whose plane v is not definable by reference to the three-dimensional 
space of J. 

Let J W =%(1 +i w ) be the existence operator of a particle w . By (16-612) 
J W F = F if =,, and J W F = if = -. What we have hitherto called 
the degree of existence of the particle w in the system F, is reduced to a 
directional relation between the spin planes w , M of e^and F. This simplifica- 
tion is due to the fact that F is a pure strain vector, and could itself be 
regarded as representing an elementary particle u with an existence operator 
J u . In our original formulae ( 16-1) F was not restricted to be a pure strain 
vector; it might therefore have a constituent agreeing with e w in direction 
but reduced in intensity by a numerical probability factor. This mode of 
partial existence of w in F is now excluded by the condition of purity. 
Accordingly the degree of failure of one elementary particle or wave system 
to exist fully in another elementary wave system is represented exclusively 
by angular deviation, and not by fractional probability. 



16-7] The Exclusion Principle 321 

To enumerate the independent eigenfunctions, we must replace V r (u, v), 
which has a continuously varying parameter u, by an appropriate number 
of functions V r (u a , v) with fixed parameters u a ^ The continuity of u is a 
degeneracy due to the symmetrical conditions postulated, which permit 
relativity rotation in three dimensions (cf. 13*6). To obtain a corresponding 
non-degenerate problem, we should limit the spin to three coordinate planes. 
Since the spin can be in either direction in each plane, this gives six alter- 
natives. We conclude therefore that the 16 eigenfunctions V r (u,v) have a 
sixfold degeneracy, and correspond to 96 non-degenerate eigenfunctions. 

The conclusion that the degeneracy is sixfold, not threefold, can be 
reached in another way. Pauli space is not an ordinary space of three 
dimensions. The Pauli matrices ^, f^, f correspond to JE? 23 , E^, E 31 , which 
have a different group relation from the matrices E ,E 29 E B which represent 
rectangular axes in ordinary space. Suppose that rectangular axes Ox, Oy, 
Oz are laid down geometrically in a three-dimensional space. Taking it to 
be an ordinary space, we associate matrices E l , E 2 , E 3 with Ox, Oy, Oz, 
respectively; in so doing we do not give any chiral quality to the space. 
Taking it to be a Pauli space, we associate Pauli matrices A , f^, with 
Ox, Oy, Oz, respectively; these satisfy ^ v = x = ?,* (3-86). But we 
can also associate with the same geometrical axes an alternative (left- 
handed) Pauli space by using the matrices in the order ^, f v , i^. Treating 
tt as a vector referred to rectangular axes, V r (u, v) will have a threefold 
degeneracy; but there will also be a duplicate set of 48 eigenfunctions 
whose existence operators (or spins) are represented in the alternative 
Pauli space. 

It might be suggested that all possible reversals have already been 
provided for by the signs in (16*52). But a consideration of the hydrogen 
atom makes it clear that the reversal of spin is additional. The orbital motion 
of an electron in any plane may be in either direction. In addition, two 
states are discriminated according as the electron spin is with or against 
the orbital motion. From this aspect the factor 2 arises from the duplexity 
indicated by the "fourth quantum number". 

16*7. Double and Quadruple Wave Functions. 

The enumeration of independent double wave functions proceeds on the 
same lines as the enumeration of simple wave functions. The maximum 
number of commuting symbols E^F V is 16, obtained by combining an anti- 
tetrad of E -symbols with an antitetrad of JT-symbols. We have therefore 16 

t The direction of v in its 3-space does not matter. There is no reference standard to 
compare it with; and, whatever the direction may be, it is called the time. (It should be 
borne in mind that the physical ideas illustrated by a universe composed of simple wave 
functions are necessarily somewhat crude.) 

ETP 21 



322 Physical Applications [16-7 

dynamical coordinates a^, and the double strain /ector function for a 
steady state, corresponding to (16-611), is 



where JJ> is a common eigensymbol of the 16 commuting JS^-symbols. The 
sixteenfold ambiguity provides 2 16 combinations. 

First suppose that V is the product of two simple strain vectors. Then in 
place of (16'612) we have 



An existence operator for a system of two particles is of the form 



since the system does not exist unless both the particles exist. The two Pauli 
frames f , ' will combine into a frame of fourfold matrices JSy/, and the 
frames 6, ff into another frame JJ/.f Then K is a simple strain vector in 
the W -frame and V is a double strain vector in the E'F'-fmmQ. 

The strain vector V is by definition a perfect square V F Y , but in general 
Y is not the product of two simple wave vectors. The system represented 
by a double wave tensor is a superposition with different probabilities of a 
number of resolvable double systems. The essential result for our purposes 
is that the geometrical relations, which replace the conception of partial 
existence, are contained entirely in the JP'-frame. Rotation in the JF'-frame 
can be ignored, since there is no reference standard in that frame. 

We have then a series of wave functions lj([im'], [vv']) 9 r= 1, 2, ... 2 16 , 
involving a continuous parameter [uu']; and we have to determine the 
degree of degeneracy implied by the continuous parameter. If V r represented 
two completely separate particles, [uu f ] would represent their planes of spin 
in three dimensions. Since the particles are not separate [uu f ] is a simple 
strain vector in an ordinary fourfold frame. This brings us to familiar 
ground. Although the domain in which we picture this vector is not ordinary 
space-time, the problem is the same mathematically. The relativity rota- 
tions, whose existence brings about the degeneracy, are the 10 rotations in 
five dimensions or the ten displacements in phase space, which we have 
studied fully in the earlier chapters. There is accordingly a tenfold degen- 
eracy; or, including the duplication due to right- and left-handed frames of 
E' each V r is equivalent to 20 non-degenerate wave functions. This gives 
the total # 2 =2. 10. 2 16 . 

It is perhaps not obvious that the number of rotations to be taken into 
account is 10 not 16. We might appeal to the formal argument that the 
corresponding factors in N^ and N 2 should be either 3, 10 or 4, 16; and since 
the factor in JVi is certainly 3, the factor in N 2 must be 10. But the result is 

t This regrouping has a resemblance to the crossing of frames in 10*4; but it is not the 
same transformation. 



16-8] The Exclusion Principle 323 

verified at once, whence notice that in comparing our double system with 
a standard system of two particles defined by the existence operator K, we 
cannot distinguish between the combination J W J W ! and J^J w f > Thus our 
reference system does not distinguish between a rotation t,^ v f and ,,/, 
and the rotations (^ ' ,,/ must be ignored in the same way that we 
ignore rotations in the 00' frame. Adapting the theory of crossed frames in 
10-6 to twofold matrices, the excluded rotations are the antisymuietrical 
(tinje-like) rotations in the crossed frame; there remain the ten symmetrical 
(space-like) rotations. 

The extension to quadruple wave functions does not raise any new point. 
The reference system of four separate particles defined by a quadruple 
existence operator has a l<J6-dimensional phase space, so that the degeneracy 
is 136-fold. 

In each case the degeneracy of the wave function corresponds to the phase 
space of a tensor of half its rank. A corresponding reduction occurs in the 
theory of the steady internal states of the hydrogen atom, which can quite 
well be (and commonly are) treated by Pauli vectors instead of Dirac 
vectors; but any deviation from the steady state involves the use of Dirac 
vectors. 

16*8. Four -point Elements of Structure. 

We can observe a relation between two physical entities. To mwtsure a relation 
we must compare it with another relation of the same kind. Thus a measure 
is a relation between two relations and involves four entities.! Formally a 
measurement always refers to just four relata, viz. the terminals of the 
object relation and the comparison relation. But in practice systems of 
combination of measurements have been elaborated which enable us to 
attribute measure indirectly to more complex networks of relationship. 

The basis of measurement is therefore a four-point element of world 
structure. It is on this principle that I have developed the generalised field 
theory in Mathematical Theory of Relativity. % Ultimately the theory of 
atomicity springs from the same origin. 

In the field theory it was necessary to couple with the four-point relation 
a condition of affine geometry, viz. that for infinitesimal relations, if the 
relation AB is equivalent to the relation CD 9 then the relation AC is equi- 
valent to the relation BD. More light is now thrown on this axiom by the 
transformation theory. I think that in attempting to compare or describe 
the difference between two objects, we are impelled to envisage a chain of 
intermediate objects to see, as it were, one gradually changing into 
another. If such a transformation is too far-fetched to be considered, we 

t In special cases two of the entities may coalesce. 
J Chapter vn, Pt. II, especially 98. 



324 Physical Applications [16-8 

are nonplussed, as when asked (in the familiar riddle^the difference between 
an orange and a grand piano. Whether or not this is true of all relations, we 
do in physics confine attention to relations which are represented by trans- 
formations. The axiom of affine geometry then reduces to the condition that 
infinitesimal transformations commute as far as terms of the first order. 

From this starting point we deduce the existence of a measure of world 
structure at each point, given by an affine curvature tensor *jB ftw , which on 
contraction yields a metrical tensor g^ v and an electromagnetic tensor F^. 
The tensor g^ v provides a Riemannian geometry, and with this we define a 
new indirect measure system. A measure is still a relation between relations; 
but we now distinguish sharply between the object relation and the com- 
parison relation. The latter is standardised; anc^ it is no longer a relation 
between specific entities, but is vaguely "contained" in the Riemannian 
geometry. The measure is transferred verbally (and usually also in con- 
ception) to the object relation, so that it appears to involve only two entities. 
Finally in the non-relativistic mode of thought which dominates current 
quantum theory (whether styling itself relativistic or otherwise) the measure 
becomes transferred to one terminal of the object relation, a geometrical 
concept (e.g. an origin of coordinates) being substituted for the other 
terminal. By this devious route we arrive at entities supposed to be endowed 
with measurable properties, e.g. an electron endowed with charge and mass, 
although it requires four entities to furnish anything measurable. 

In wave mechanics a distinction is drawn between observables and 
unobservables. It will here be clearer to call them measurables and un- 
measurables. Current wave mechanics attributes measurables (momenta, 
coordinates, spins, etc.) to single entities such as an electron. The measur- 
ables are, however, four-point relations, which become attached to the 
electron because the other three relata are standardised. In the method of 
wave mechanics measurables are expectation values or eigenvalues derived 
from wave functions; and in attaching the measurables to the electron we 
attach also the wave functions which contain them. Primitively the wave 
function of an electron is that of a four-point element of structure. 

The wave function is quadruple, since it specifies a quadruple probability 
distribution of four entities. We have found that the number of independent 
quadruple eigenfunctions is N = 2 . 136 . 2 256 . It is well known that (with due 
allowance for degeneracy when it occurs) the number of eigenfunctions 
persists in all transformations. By the changes in plan of measurement, to 
which we have referred, the quadruple wave functions become replaced by 
double and finally by single wave functions. Our position has been that, for 
better or worse, current theory has made this substitution; and since it is 
implied in all standard nomenclature, we must accept it. We have seen that 
certain previously inexplicable features in current physics, such as the mass- 



16-8] The Exclusion Principle 325 

ratio of the proton and electron, are impositions, which we are forced to 
accept in order to validate the substitution. Another such result is that by 
substituting N^ simple wave functions for the N quadruple wave functions, 
2 . 136 . 2 266 simple wave functions are forced into a space-time which is built 
to accommodate 96. The result of this overcrowding is that instead of their 
wave lengths being naturally adjusted to the curvature of space, they occupy 
spherical harmonics of high order and short wave length as explained in 
Chapter xiv. 

Our result, that the "number of particles in the universe" is the number 
of independent non-degenerate eigenfunctions in a quadruple wave system, 
thus comes from the fact that a measurement, or comparison of relation with 
relation, is an expectation value determined by a quadruple wave system. 

The number of dimensions of space-time can be regarded as one of the 
numerical constants of nature. Effectively, the number 4 was assumed at 
the beginning of our theory, when we chose a symbolic frame based on 4 
anticommuting square roots of 1. But we can now determine it in the 
same way as the other natural constants, and show a fundamental reason 
why space-time has 4 dimensions and no more. 

Space-time may be defined as the continuum in which we represent a 
certain type of physical relation called "displacement". The relation 
connects two relata which are conceived geometrically as points "having 
no parts and no magnitude". We need not add that they have position. 
Their relative position is expressed by the relation of displacement already 
mentioned; absolute position is denied. To pass from this geometrical abstrac- 
tion to a physical universe, we must turn the points into physical entities; 
so that "displacement" is now a relation between entities occupying the 
points. These entities are represented by simple (commuting) existence 
operators, which indicate whether in a given operand (representing a state 
of the universe) the points are occupied or not. Nothing more is required; 
for, so far as the relation of displacement is concerned, the only property 
of the entity is that it occupies the terminal of the relation. The simple 
existence operators J, J r give a combined existence operator J J', which will 
also be the existence operator of the displacement; for the relation will not 
exist unless both the relata exist. 

We have now to form a continuum of displacements. This is constructed 
by forming a Group of operators, which includes all operators of the form 
JJ'. It is not necessary that the Group should consist of J J' operators 
alone; but the JJ 9 operators will be distinguished as the pure operators 
of the Group.f It is simplest to consider the matrix representation of the 

t In a Group the transformations consist of the same operators as the field. It would be 
contrary to the ordinary conceptions of transformation theory to limit the transformations 
to pure operators. Consequently the field must be extended to include impure operators. 



326 Physical Applications [16-8 

operators. Since J has two eigenvalues, it can be represented as a twofold 
matrix. Then JJ' is represented by a fourfold matrix, or equivalently by 
an JS?-number. The required Group is therefore that of the ^-numbers. 

This is the simplest solution; but we must also show that more complex 
solutions are excluded. For example, J might be represented by an idem- 
potent fourfold matrix; but such a matrix has been shown to be of the form 
%(l + i). (1 + id), i.e. the product of two idempotent twofold matrices. 
Since J is the product of two simpler existence operators, the entity defined 
by it can be analysed into two parts either of which can exist without the 
other. This case is excluded because it is implied in the conception of dis- 
placement that the entities related by it are like points "having no parts". 
To represent J by a matrix of order higher than the minimum order 2 
implies that the entity defined by it is something more than an "occupied 
point". 

It being established that displacement is represented by an J?-number, 
we can proceed to develop the theory of space as in Chapters TV and vi. This 
gives the result that space-time is a curved four-dimensional continuum, 
and that its signature is 3 + 1. 

16*9. Reaction on Macroscopic Theory. 

For the most part there is no occasion to modify the existing macroscopic 
relativity theory ,f which I believe to be correct so far as it goes (except for 
the important amendment explained in 8*8). The following are the chief 
points that claim attention when it is reviewed in the light of the present 
results : 

(1) There would seem to be a doubt whether Einstein's theory holds 
rigorously for a rotating macroscopic body, e.g. a star. % The whole subject 
is very obscure, but I think the question can be put in the following form. 
Doubtless the particles in a star have attained a statistically steady dis- 
tribution of spin, such that as many spin in one direction as in the opposite. 
But is this balance of spin relative to (a) axes rotating with the material, or 
(b) Galilean axes ? Since the balance is brought about by interactions between 
the particles, either directly or with radiation as intermediary, I should 
conjecture that the answer is (a). If so we have, relative to the Galilean axes, 
a distribution of sources (amounting in the aggregate to a macroscopically 
appreciable source) for which Einstein's theory provides no notation. If the 
answer is (6), the difficulty does not arise. 

t For definiteness, this is understood to be the theory set forth in my Mathematical Theory 
of Relativity, except that important advances have since been made in the cosmical problem. 
Chapter v should therefore be supplemented by B. C. Tolman's Relativity, Thermodynamics 
and Cosmology, pp. 331-488. 

I Doubt of a similar kind has been raised by Sir J. Larmor (Nature, 137, 271 (1036)). 



16-95] The Exclusion Principle 327 

Assuming (a), the C|tse appears to be one in which it is necessary to use the 
Riemann-Christoffel matrix ( 11 -4). The effect, if any, can scarcely be large 
enough to be of macroscopic importance. Its interest lies in its bearing on 
the conservation of energy, since it suggests the existence of a new type of 
field which, if it is not taken into account, would render even the macro- 
scopic conservation of energy imperfect. 

(2) Since g^ v and K O denote measurable properties of macroscopic fields, 
they are expectation values determined by the wave functions of a great 
number of individual particles. We may indicate this average character 



explicitly by the notation g^ v , K a . In general g tLV K ^g VLV K a9 so that in the 
generalised field theory it is represented by a separate symbol ^ Vf<7 .t 
The generalised theory reduces to Weyl's theory if K tlVta ^g VkV K a . Weyl's 
theory is therefore the approximation obtained by neglecting the difference 
between the average of a product and the product of averages. From a 
practical point of view the correction is trivial; but the question of practical 
application scarcely arose in these theories, whose aim was to gain insight 
into the foundations of world-structure. The point brought out in the general 
theory, which is not seen in Weyl's theory, is that the law of gravitation is 
not an additional limitation but is simply the gauging equation; this is one 
of the essential links in connecting the field theory with the microscopic 
theory developed in this book. 

As it happens, we have employed Weyl's theory in a practical way on 
several occasions (12-7, 12-8). To study the electromagnetic field we 
create an artificial field of electromagnetic potential by a gauge transforma- 
tion; just as in Einstein's theory we study the gravitational field by creating 
an artificial field of gravitational potential by a coordinate transformation. 

16-95. Philosophical Outlook. 

We conclude with a brief reference to the philosophical position towards 
which the present results trend. Unless the structure of the nucleus has a 
surprise in store for us, the conclusion seems plain there is nothing in the 
whole system of laws of physics that cannot be deduced unambiguously 
from epistemological considerations. An intelligence, unacquainted with 
our universe, but acquainted with the system of thought by which the 
human mind interprets to itself the content of its sensory experience, should 
be able to attain all the knowledge of physics that we have attained by 
experiment. He would not deduce the particular events and objects of our 
experience, but he would deduce the generalisations we have based on 
them. For example, he would infer the existence and properties of radium, 
but not the dimensions of the earth. 

The mind which tried to apprehend simultaneously the complexity of the 
t Mathematical Theory of Relativity, 93. 



328 Physical Applications [16-95 

universe would be overwhelmed. Experience must be dealt with in bits; 
then a system must be devised for re-connecting the bits; and so on. One 
outcome of this treatment is that the universe is passed through a sieve 
with 3.10 79 holes to render it more comprehensible. In the end what we 
comprehend about the universe is precisely that which we put into the 
universe to make it comprehensible. 

So far as I can trace, the earliest sign of uneasiness among physicists 
about this procedure was shown in connection with the analysis of white 
light by a grating. The analysis of white light by a prism had been looked on 
as a discovery of its composite nature. When the same thing was done with 
a grating, it was seen that we were openly manufacturing the periodicities 
we thought we had discovered in it. As to the composite nature of white 
light, all that can be said is that if a mathematician chooses to analyse a 
function into Fourier components he is at liberty to do so. 

The theory of relativity drew much fuller attention to the subjective 
aspect of many of the laws of physics. The Lorentz transformation comprises 
a number of laws which seem to describe properties of the natural objects 
which surround us (e.g. change of mass with velocity, FitzGerald con- 
traction). But we put these properties into the objects because it is our 
habit to refer everything-to a reference frame in which space and time are 
separated, although there is no such separation to be found in nature. 
Passing to a more advanced illustration of subjective influence, we have 
seen ( 13-4) that the principle of least action arises because (as part of our 
system of comprehending experience in bits) we feel the need to localise the 
various measures which we employ. Localisation is an artificial concept in 
an interrelated universe; and indeed in elementary wave mechanics the 
conception of energy and momentum is primarily introduced as an attribute 
of infinite plane waves, spread over the whole universe. 

In microscopic physics the question of how much we discover and how 
much we manufacture becomes still more acute. We cannot observe a 
microscopic particle without grossly interfering with it. It is often said that 
the particles are put into particular states by the type of experiment we 
perform on them. That is scarcely a fair view of the nature of the inter- 
ference by the observer. Ideally he might wait until the conditions of the 
experiment reproduced themselves naturally. His interference is selective 
rather than active. But so far as physical theory is concerned, it makes 
little difference whether the observer selects the state, or puts the system 
into the state, which he "discovers". Thus before enumerating the cha- 
racteristics of an elementary particle, we have to indicate the type of 
observation by which its existence is supposed to have been recognised. 
Was it the momentum or the position that was noticed? Or was it inferred 
as one of n particles whose total momentum and mean position were found ? 



16-95] The Exclusion Principle 329 

Thus at various stagey in this book we have had to distinguish specified, 
unspecified, and macroscopically specified particles, internal and external 
particles, neutral and vector particles and so on. Their diverse properties 
are the result of the different varieties of observational interference (active 
or selective) which precede our recognition of them. At the beginning we 
treated abstract elementary particles supposed to have been the subject 
of absolute measurements of position and momentum. In a sense these 
particles were all protons or electrons. But only when we recognised that 
actual measurements of momentum and position are relative, did we reach 
particles having the characteristic mass and charge of the proton and 
electron. 

Fifteen years ago I was^responsible for an oft-quoted remark,! tfc It is one 
thing for the human mind to extract from the phenomena of nature the 
laws which it has itself put into them; it may be a far harder thing to extract 
laws over which it has had no control. It is even possible that laws which 
have not their origin in the mind may be irrational, and we can never succeed 
in formulating them." This seems to be coming true, though not in the way 
that then suggested itself. Laws of atomicity have since been discovered, 
and have turned out to be rational and comprehensible to the mind; but it 
turns out also that they have been imposed by the mind in the same way as 
the other rational laws. But a new situation has arisen, because we now 
recognise that the totality of (mind-made) law does not impose determinism. 
Room is left within the scheme of physical law for undetermined behaviour. 
Behaviour whose laws are irrational was perhaps as near to the conception 
of undetermined behaviour as the thought of the time could reach. 

The physicist might be likened to a scientific Procrustes, whose anthro- 
pological studies of the stature of travellers reveal the dimensions of the bed 
in which he has compelled them to sleep. Yet I do not think that we take 
unwarrantable liberties with the universe in our Procrustean treatment of it. 
If experience is a subject-object relation, the subject is entitled to nay, 
he cannot divest himself of his half-share. It can scarcely be a coincidence 
that Heisenberg's uncertainty principle has defined the half-way line with 
mathematical exactitude, distributing a coordinate to one side and a 
momentum to the other side with perfect impartiality. And so we may 
look forward with undiminished enthusiasm to learning in the coming 
years what lies hidden in the atomic nucleus even though we suspect that 
it is hidden there by ourselves. 

t Space Time and Gravitation, p. 200. 



INDEX 



A priori probability, 107, 198 
Absolute and relative displacement, 109, 183 
Action invariant, 136, 243 
Action principle, 238, 240 
Added system, 262, 266, 268 
Addition, non-commutative, 154; general- 
ised, 309 

Additive characteristics, 154, 309, 315 
Affine geometry, 183, 323 
Algebraic coordinate in phase space, 106, 

216, 220 

Algebraic frame, 48 

Algebraic function, 125; wave function, 120 
Algebraic number, 20 
Algebraic wave tensor, 89, 248 
Angular momentum, 122; dynamical, 123; 

quantisation of, 147; macroscopic, 248, 

326. See also Spin 
Antedating, 111, 224 
Anticommuting symbols, 21, 51, 312 
Antiperpendicular rotation, 51, 94 
Antisymmetrical matrices, 40, 96; double 

matrices, 159, 166 

Antisymmetrical tensors, 54, 57, 58, 181 
Antisymmetrical wave functions, 167, 288, 

290 

Antitetrad, 23, 69, 128 
Antithetic, 75 
Antitriad, 23, 69, 128 
Associated strain vector and space vector, 

103 

Asterisk notation, 16 
Augmented phase space, 217, 219 
Average particles, 234 

Basis of statistics, 198, 216, 281, 286 
"Blank sheet", 32, 40, 56, 107, 109, 287 
Bond, W. K, 272, 273, 303 
Born, M., 151 
Born-Infield theory, 136 
Box problem, 235 

Cardinal operator, 312 

CD frame, 161, 167, 203 

Chain multiplication, 16, 155 

Chandrasekhar, 8., 235 

Characteristic equation, 44 

Charge (electric), sign of, 49, 80, 88, 168; 

association with mass, 223; origin of, 283; 

value of e, 304 
Charge-current vector, 90. See Stream 

vector 



Clifford's numbers, 22 

Closure of space, 78, 83, 97 

Cogrodient, 14 

Collision interchange, 245 

Combined systems, 110, 158, 215 

Commutation rules, Jordan- Wignor, 312; 

Einstein-Bose, 314 
Commuting operators (W, (\, (7 a , Z7 3 ), 128, 

141 ; for hydrogen, 144 
Compact .AJ-numbers, 72, 101 
Comparison fluid, 180; partial, 196; neutral, 

206; indistinguishable), 294 
Complementary stream vector, 311 
Complete energy tensor, 163 
Complete momentum vector, 65 
Complete sots, 22, 35, 47, 156; transforma- 
tion of, 27, 160 
Complete space vector, 54; constitution of, 

57 

Complete stream vector, 65, 73 
Complex conjugate wave functions, 131 
Component of an ^-number, 23; formula for, 

37; in double frames, 157 
Composite individuality, 234, 285, 307, 315 
Configuration, 95 
Conjugate triads, 23, 42, 47, 160 
Conservation of probability, 115, 120; of 

energy and momentum, 229, 327 
Constant of gravitation, 273 
Constants of nature, 3; revision of, 304 
Continuous and discrete wave functions, 213, 

226 

Contracting universe, 279 
Contraction of matrix (spur), 36; of double 

matrix, 156; of wave tensor, 206; of 

volume element, 213 
Contragrodiont, 14 
Contravariant, 15 

Cosmical constant (A), 3, 188, 194, 279; in- 
determinacy of, 229, 238, 252 
Cosmical Riemann-Christoffei tensor, 202, 210 
Cosmical system (system .4), 266 
Coulomb energy, 281, 297, 301 
Coupling of spin, 228 
Covariant derivative, 121; with respect to 

X, 289, 300 
Covariant vector, 15; wave tensor, 19, 94, 

109 

Crossed frames, 161 ; significance of, 162 
Crystal grating, 306 
Curvature of space-time, 5, 55, 82, 126, 203, 

256 



332 

Curvature tensor, gauge-invariant, 251 
Cylindrical curvature, 220 



Index 



^-symbols, 125, 317 

Darwin, C. G., 1 

de Sitter space-time, 5, 182, 192, 202, 207, 
211, 264 

Debye-Hiickel effect, 139 

Deflection value of e/m, 304 

Degeneracy formula, 234, 253 

Degeneracy of systems and oigenfunctions, 
5, 247, 292, 321 

Degenerate ^-number, 23 

Description, systems of, 13, 14, 117 

Determinant of an J^-number, 42, 107 

Differential wave equation, 119; strain 
vector form of, 130 

Diffraction of electrons, 305 

Dirac, P. A. M., 1, 6, 15, 36, 62, 90, 115 

Dirac's theory of the positron, 265 

Dirac' s wave equation, 63, 87, 106; dif- 
ferential equation, 119, 130, 144, 151 

Discrete and continuous wave functions, 213, 
226 

Displacement vector, 15 

Displacements, existence operators of, 325 

Divergence operators, 118 

Double frame, 155; wave vector, 155; phase 
space, 166; existence operator, 322, 325 

Double tensors, importance of, 169, 181 ; re- 
duction to simple tensors, 207, 212 

Doublc-valuedness of wave function, 60, 
151 

du Val, P., 36 

Dual of a tensor, 162 

Dual B.C. tensor, 192 

Dynamical coordinates, 116, 123, 127, 140 

Dynamical equations, 126, 128, 144, 213; for 
strain vector, 136 

^-numbers, 21 ; matrix representation (given 

in full), 42 
^-symbols, 22; line as independence of, 24; 

representation by matrices, 36, 39 
Eigenfunctions, enumeration of independent, 

318; double and quadruple, 321 
Eigensymbols and eigenvalues, 44 
Einstein, A., 1 
Einstein-Bose particles, 312 
Einstein space-time, 5, 195, 211, 264 
Einstein universe, 192, 256, 258, 278; 

metastability of, 277 
Electric charge. See Charge 
Electric moment, 89 
Electrical energy tensor, 189, 194 
Electrical matrices, 84; rotations, 84 



Electromagnetic potential (* tt ), 119; gauge 
transformation of, 134; origin of, 137; 
macroscopic character of, 139 

Electron-point, 140 

Electron pressure, minimum, 234, 258 

Elementary particle, 74 

Energy, conservation of, 229, 327; furnished 
by specification, 197; threshold, 262 

Energy invariants, 221, 260, 267 

Energy levels of hydrogen, 147, 303; ne- 
gative levels, 265 

Energy operator (T a3 ), 230, 250 

Energy tensor, 91, 162, 169, 188, 229, 315 

Environment, 32; standard, 129, 143, 150, 
180 

Epistemologv, 5, 328 

Equipartition of energy, 221, 254, 268 

Equivalence of frames, 29, 32; of points in 
space- time, 56, 77; of volumes, 99 

Exact energy tensor, 164 

Exclusion principle, 231, 233, 253; limited 
to steady systems, 246, 254; in secondary 
quantum theory, 315 

Existence, degree of, 309, 320 

Existence operators, 308; double, 317, 322; 
geometrical representation, 320 

Expanding universe, 225, 273, 279 

Expectation value, 38, 149; of spin mo- 
mentum, 122; of energy operator, 231, 
257 

Exponential, non-algebraic, 50 

Extended wave function, 284 

External and internal states, 92 

Factorisation of wave tensors, 66; conditions 
for, 69, 70; in secondary quantum theory, 
310, 314 

Factors, numerical, (J JJ), 10, 302, 304, 307; 
(*), 227, 258, 276; (), 259, 276, 278; 
(2), 143, 152, 261, 274/277, 318, 319 

Factors, symbolic, 37 

Fermi-Dirac statistics, 282; law of anti- 
symmetry, 290; particles, 312 

Field theories. See under Generalised, Uni- 
fied and WeyV* theory 

Final wave vectors, 16, 18 

Fine structure constant, 3, 153, 295, 298 

Five-dimensional relativity, 190 

Five-dimensional space vectors, 63 

Flat space-time, 5 

Four dimensions, space vectors in, 55; reason 
for, 3, 325 

Four-point elements of structure, 170, 323 

Four-point matrices, 34, 38, 96 

Fowler, R. H., 236 

Frame, geometrical, 179 



Index 



333 



Frame, symbolic, 29; right- and left-handed, 
48; neutral and macroscopic, 84; double, 
155; crossed, 161; of identity, 291 

^riedman-Lemaitre theory, 279 

Frobenius's theorem, 46 

Gauge-invariant curvature tensor, 190, 251 
Gauge transformation, 135, 223, 226, 295; 

non-integrable, 137 ; natural gauge, 136 
Generalised field theory, 5, 85, 190, 295, 324, 

327 ; amendment of, 136 
Gravitation, constant of, 3, 273; law of, 

136. 

Gravitational potential energy, 264 
Ground state of the universe, 256, 259, 279 
Group property, 13 

Hamiltonian, 63, 129; reality conditions for, 
87; strain vector form, 106; for internal 
coordinates, 175; for standing waves, 237 

Heisenberg, W., 1. See Uncertainty principle 

Helium, packing ratio in, 10 

Hermitic conditions, 75, 131, 152 

Homothetic, 75 

"Hybrid" particles, 201 

Hydrogen atom, wave equation of, 145; in 
practical units, 153; interaction in, 291; 
evaluation of Coulomb term, 298 

Hyperbolic rotation, 76, 97, 217 

Idempotent symbols, 70; physical import- 
ance of, 91 

Identity, the fundamental, 67; generalisa- 
tion of, 187 

Identity (physical), relativity of, 290; com- 
posite, 234, 285, 307, 315 

Imaginary gauge- transformation, 135 

Imaginary matrices, 40 

Index of wave tensor, 131, 152, 274, 304 

Indistinguishable particles, 282; dynamics 
of, 285; comparison fluid, 293; protons 
indistinguishable from electrons, 287 

Inhibited rotation, 110 

Initial probability distribution, 107, 11)8, 
216, 281, 286 

Initial wave vectors, 16, 18 

Inner product, 15; notation for, 17 

Integrated energy tensor, 253 

Interaction, 281; of system with environ- 
ment, 112, 116, 129; at boundary, 249; in 
hydrogen atom, 291; in general systems, 
154, 298 

Interchange, continuous, 283 

Interchange coordinate. See Permutation 
coordinate 

Interchange energy, 245, 283, 293 



Interchange operator, 158; factors of, 159; 

for double frames, 160; associated with 

Coulomb energy, 301 
Interlocked transformations, 54, 109 
Internal and external states, 92 
Interval, 75; measurement of, 81, 242 
Invariant time (s), 116, 129; represented by 

algebraic coordinate, 106, 216 
Inverted cross, 161, 164 
Irrational laws, 329 
Irregular metric, 211 
Irrevorsibility of time, 117, 225 

Jordan, P., 151 

Jordan- Wigner wave functions, 169, 311 

A' state of a system, 259, 269 
Kinematical energy tensor, 189 
Kinomatical matrices, 84; rotations, 84 
Kinetic energy, distribution law, 246 
Kummer collincation group, 36 

Left-handed frames, 48, 81, 85, 172 

Lemaitre, G., 47, 279, 319 

Light-time, 111, 129, 218 

Limit state, 259 

Linear transformations, 13 

Linkage of system to surroundings, 32, 111, 
117, 129, 292 

Linked rotations, 182, 187; translations, 189; 
displacement in time, 187, 192, 201 

Littlewood, D. K, 71 

Local orthogonal coordinates, 98 

Local system (system /?), 266 

Localisation of characteristics, 240 

Lorentz transformations, 76, 116, 249; in- 
applicable to internal state, 92, 96; to 
Einstein world, 211; to exclusion prin- 
ciple, 231; to non-transferable energy, 
254 

MacDuffee, C. C., 7 

Macroscopic and microscopic theory, con- 
nection of, 4, 169, 201, 208, 256; in secon- 
dary quantum theory, 315 

Macroscopic relativity theory, emendations 
required, 136, 279, 326; new light on, 190, 
327 

Macroscopic set, 83 

Magnetic field, energy due to, 227 ; strong and 
weak fields, 228 

Magnetic moment, 88, 89 

Mass, origin of, 201, 262, 268, 276 

Mass-constant, 230 

Mass-ratio of proton and electron, 3, 219,. 
304 



334 



Index 



Matrices, 16; non-commutative property, 17; 
components, 36; real and imaginary, 40; 
Pauli, 47; kinematical and electrical, 84; 
space-like and time-like, 96; double, 155 

Matrix representation, advantage of, 39, 96 

Matrix representing general ^-number 
(given in full), 42 

Measurables, 170, 324 

Measurement in strong fields, 81, 243 

Motastable states, 150 

Metrical tensor (^), measurement of, 81; 
identified with kinematical self energy 
tensor of comparison fluid, 196, 209 

Microscopic and macroscopic theory, con- 
nection of, 4, 169, 201, 208, 256; in secon- 
dary quantum theory, 315 

Microscopic vorticity, 189, 326 

Milner, S. R., 48 

Minimum equation, 44 

Minor complete set, 47 

Minus particles, 88, 90, 264, 319 

Mixed tensor, 14, 29, 54, 94 

Modifying factor, 107, 281 

Momentum and position, 162. See also 
Uncertainty principle 

Momentum operator, 120, 131; in practical 
units, 152; relation to energy operator, 
230 

Momentum vector, 63; complete, 65; non- 
algebraic, 120 

Monothetic, 75 

Multiplication, inner, outer and chain, 15, 
16; non-commutative, 17; of probabilities, 
154, 213, 309 

Multiplicity of frames, 204, 317 

Mutual energy invariant, 221, 260; pressure 
invariant, 221, 231 

Mutual energy tensor, 165, 205, 315 

Natural coordinates, 98 

Natural gauge, 136 

Nebulae, speed of recession, 279 

Negative and positive charges, 81, 88, 223, 

301; balancing of, 278 
Negative energy levels, 265 
Negations, 90, 265 
Neumann, J. v., 2, 36 
Neutral particle, 89; mass of, 219, 267; 

formula for mass (m ), 271, 275 
Neutral space-time, 80; sets, 83; comparison 

fluid, 206 
Neutron, 7 

Newman, M. H. A., 42 
Nonads, 42 
Non-algebraic phases, 97; wave functions, 

120 



Non-commutaipve multiplication, 17; ex- 
ponentials, &tr, 53; addition, 154; wave 
functions, 311 

Non-integrable gauge transformation, 137 / 
Normalised ^-numbers, 70; strain vectors, 

106 

Nuclei, atomic, 7; disruption by protons, 246 
Number of dimensions of space-time, 3, 325 
Number of particles in the universe, 272, 
316, 325; relation to number of eigenfunc- 
tions, 319 

Numerical constants of nature, 3 

Object system, 180 

Obsorvables, 170, 324 

Observer, 79; his selective interference, 328 

Outer product, 16; notation for, 17 

Packing ratio in helium, 10 

Parallel displacement, 122; generalised, 126; 
in comparison fluid, 182; in \> 288 

Parallel strain, 134 

Parameters of wave functions, 60; promo- 
tion to coordinate rank, 123, 289 

Partial comparison fluid, 196, 198, 207, 210 

Partial existence, 309, 320 

Partially occupied states, 311, 317 

Particles, 8; elementary, 74; neutral, 89; 
scalar, 231 ; number in the universe, 272, 
316, 325 

Particles, external and internal, 174, 221 ; of 
composite identity, 234, 285, 307, 315; in- 
distinguishable, 282, 285; excluding (Formi- 
Dirac) and non-excluding (Einstein-Bose), 
312 

Partition, scheme of, 183 

Pauli matrices, 47, 160, 248, 278, 310 

Pauli space, duplexity of, 321 

Pentad, 23, 35; real and imaginary members, 
40 

Pentadic part, 68 

Permanent, 59 

Permutation coordinate, 153, 228, 283, 284; 
relativity of, 291; recoil, 293 

Perturbation, 116, 141, 239, 249, 292 

Phase, 97 

Phase space, 95; singular, 112; double, 165; 
in CD frame, 166 

Philosophical outlook, 327 

Photons, 263, 315 

Physical reality, invariance of, 76 

Position and momentum, 162. See also 
Uncertainty principle 

Position vector, 73, 162; tensor, 164 

Positive and negative charges, 81, 88, 223, 
301; balancing of, 278 



Index 



335 



Positively saturated space, 81 

Positron, 00, 264 

Potential. See Electromagnetic potential 

I rtential energy, 245, 264 

Pressure in degenerate gas, 235, 236; relation 
to energy-density, 237, 241 ; change of zero 
reckoning, 258, 269 

Pressure invariant, 221, 231 

Probability, 107; distributions of ("states") 
32, 115; conditions of conservation, 119, 
12C; concerned in dynamical equations, 
142; multiplication of, 154,213; of identifi- 
cation, 284, 286; form of partial existence, 
309. See also Initial probability distribution 

Procrustean methods, 329 

Progressive waves, 247 

Projectivo relativity, 56 

Proper time. See Invariant time 

Pseudo-reciprocal, 26, 63, 112 

Pure wave tensors, 65, 68; standard forms, 
69; idempotent condition, 70, 73; physical 
importance of, 91 

Purity lost by contraction, 208 

Quadratic equation for mass, 219, 304 
Quadruple wave functions, 170, 297, 323 
Quantum numbers, ri, 147; u, 147; p, 149 
Quantum theory. See Microscopic theory and 

Wave mechanics 
Quarterspur, 23, 37 
Quotient of ^-numbers, 25 

Radiation, 7; fields of, 143, 262 

Real and imaginary matrices, 40 

Reality conditions, 86; in phase space, 97; 

for gauge transformation, 135 
Recession of the nebulae, 279 
Reciprocal, 26 
.Recoil of comparison fluid, 183; in time, 186, 

191; in *, 293; in r, 296 
Reduced mass, 174 
Reflection of a complete set, 28 
Relata and relations, 170, 181, 323 
Relative and absolute displacement, 109, 

183 

Relative coordinates, 170, 174; energy ten- 
sor, 197, 205; time, 217, 295; identity, 290; 

existence operator, 313 
Relativistic degeneracy, 235, 246, 253 
Relativity rotation (or transformation), 30, 

50, 165, 248; of identity, 283, 285 
Rest energy (or mass), origin of, 262, 268, 276 
Riemann-Christoffel matrix, 187 
Riemann-Christoffel tensor, 181; relation to 

energy tensor, 188; dual of, 192; cosmical, 

202 



Right- and left-handed frames, 48, 81, 85, 

172 
Rotation, 30; antiperpendicular, 51, 94; 

circular and hyperbolic, 76, 97, 217. See 

also Relativity rotation 

Saturated space, 81, 164 

Scalar particles, 231; mass-constant of, 236; 

system of N', 259 

Scalar product, 15; notation for, 17 
Scalar wave functions, 229; relation to 

vector wave functions, 247 
Schrodinger's wave mechanics, 75, 89, 143, 

229, 232 
&P-numbers, 35 
Second quantisation, 308 
Secondary quantum theory, 310, 315 
Self-consistent field, 260, 312 
Self -energy tensor, 162, 165, 205, 209, 315; 

complete, 163 

Solf -pressure and energy invariants, 221, 260 
Signature of space-time, 3, 326 
Simultaneity in internal space, 92, 110; not 

applicable to imaginary time, 295 
Single-valued wave functions, 150, 287 
Singular matrices and symbols, 18, 26; con- 
ditions for, 43, 44 
Singular phase space, 112; transformations, 

173, 308 

Sitter, W. de. See <U Sitter 
Six-dimensional group of rotations, 85 
Six-vector, 57 
Sommerfeld's formula for energy-levels, 147, 

302 

Sound waves, 245 

Space-like matrices, 96; in double frame, 165 
Space tensors, 15; vectors, 53; complete, 64; 

association with strain vectors, 104, 110 
Space-time, four dimensions of, 3, 325; 

goneses of macroscopic, 80 
Specification of particles, 196; energy fur- 
nished by, 197, 267; macroscopic, 201 
Spectral set, 72; analysis of algebraic num- 
. bers, 89, 249 

Spherical curvature (Gaussian), 78, 220 
Spherical space, 5, 55. See de Sitter and 

Einstein space-time 
Spin, 73, 88; momentum, 122; coordinates, 

221, 283; duplexity, 321 
Spur, 36 

Standard environment, 129, 143, 150, 180 
Standing waves, 234, 244; hamiltonian of, 

237 
States, 32, 115; degeneracy of, 124, 247, 289, 

292 
Statistical equilibrium, 141, 239 



336 



Index 



Steady state, 140; generalised, 143 
Stereographic coordinates, 101; application 

of, 218, 219 

Stereographic projection, 257 
Stern-Gerlach effect, 226, 278; in weak fields, 

228 

Straight cross, 161, 164 
Strain vector, 94; association with space 

vector, 104, 110 
Strain vector form of wave equation, 106, 

130 
Stream vector, 65; of elementary particle, 

88; connection with momentum, 122; 

algebraic, 248; in secondary quantum 

theory, 310 

Strong action principles, 242 
Strong and weak magnetic fields, 228 
Suffix coordinates, 283 
Suffixes, omission of, 16 
Summation convention, limited use, 22 
Symbolic calculus, 20, 125 
Symbolic factors, 37 
Symbolic frame, 29 
Systems A, R (cosmical and local), 266; A' 9 

B' t 270 

Temple, G., 27, 73, 145, 151 

Tensor, 13 

Tensor-density, 58 

Tetrode, H., 2 

Threshold energy, 262 

Time, distinctive character of, 75; in in- 
ternal states, 92; irreversibility of, 117, 
225; linkage of displacement in, 187, 192; 
treated differently from spatial displace- 
ment, 201. See also Invariant time 

Time, imaginary relative, 295 

Time-like matrices, 96; in double frame, 165 

Tolman, R. C., 256, 273, 326 

Transformations, etymological, 136 

Transformations, linear, 13; of wave tensors, 
17, 18; of complete orthogonal sets, 27; 
unitary, 43, 98; of strain vectors, 94; 
general symbolic, 125; of gauge, 134; non- 
integrable, 137; of double wave vectors, 
155 ; of double frames, 162 ; associated with 
interchange, 283, 287; of existence condi- 
tions, 308. See also Lorentz transformations 

Transitions, 185, 292 

Translations, 77; linkage of, 189 

Transpose, 19 



Triads, 23; coHugate, 23, 42, 47, 160 
Two particles, representations of, 167 

Uncertain energy tensor, 164, 203, 210 

Uncertainty principle, 74, 162, 169, 179, 191, 

. 275, 286, 329 

Unified theories, 4, 243 

Unit matrix, 18 

Unitary transformation, 43, 98 

Universe, 7; Einstein and de Sitter forms, 

211; analysis of Einstein universe, 256; 

expanding, 273, 279; number of particles 

in, 272, 316, 325 
Unspecified particles, 196 

Vector, 16; basic, 15, 54; space, 53, 54; 

strain, 94; double, 155 
Vector density, 58; three-dimensional, 59, 
105; discriminated by uncertainty prin- 
ciple, 74; by reality conditions, 86, 87 
Vector wave function, 60; relation to scalar 

wave function, 248 

Velocity of indistinguishable particles, 285 
Volume clement, 58, 99; contraction of, 213 
Volume of phase space, 99 ; of spherical space, 
257 

Wave equation, 1, 62, 106; differential equa- 
tion, 119, 130; for hydrogen, 144, 153, 298 
Wave functions, 60, 66; replacement of 
double by single, 212; scalar, 229; w-tuple, 
244; Jordan- Wigner, 310 
Wave mechanics, 1 ; method, not theory, 7, 
66, 256 ; importance of conception of states, 
32, 115; principles of approximation, 112, 
129, 249; a statistical theory, 239; modi- 
fied outlook of secondary theory, 309, 315 
Wave packets, 225, 263, 286; pockets, 263 
Wave tensor calculus, origin of, 2, 15 
Waves equivalent to curvature, 256 
Weak action principles, 242 
Weak and strong magnetic fields, 228 
Weyl's theory, 1, 5, 85, 135, 136, 252, 295, 
327. For the author's generalisation see 
Generalised field theory 
White dwarf stars, 235, 255 

Zanstra, H., 10 
Zariski, O., 36 

Zero reckoning of energy and pressure, 188, 
195, 197, 229, 238, 258, 266, 313 



CAMBRIDGE: PBINTED BY WALTER LEWIS, M.A., AT THE UNIVERSITY PRESS